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ACES  PRESIDENT  S  STATEMENT 


Were  it  not  for  the  ACES  Annual  Reviews,  it  would  be  difficult  to  justify  being  in  Monterey  In  March.  (This 
coming  from  one  who  spent  the  Winter  of  '95-'96  in  the  MidwestI)  But  it  is  time  for  ACES’96,  so  here  we  are 
in  Monterey,  once  again.  And  we  think  that  this  will  be  the  best  review  yet. 

In  addition  to  our  regular  staple  of  high-quality  technical  papers,  and  our  world-renowned  short-courses, 
we  are  adding  anew  feature  this  year  —  an  amateur  radio  session.  Judging  from  the  number  of  interesting 
questions  and  applications  described  in  the  NEC-LIST,  amateur  radio  operators  are  a  natural  part,  or  should 
be,  of  the  ACES  community.  Perry  Wheless  is  to  be  commended  for  assembling  this  session,  which  appears 
to  be  well  populated. 

Our  congratulations  go  to  Dick  Gordon,  who  is  the  Technical  Program  Chairman,  and  his  colleagues,  Eric 
Michielssen,  Jin-Fa  Lee,  Robert  Lee,  W.  Elliott  Hutchcraft,  Bob  Bevensee,  and  Dick  Adler  for  organizing  and 
managing  the  12th  Annual  Review.  Their  hardwork  is  another  manifestation  of  the  can-do  spirit  of 
dedicated  volunteers  In  a  voluntary  society. 

There  must  be  more  of  you  out  there  who  are  ready  to  step  fonvard  and  offer  your  time  and  services  to  ACES. 
Call  me!  The  Society,  the  Profession,  and  you  will  benefit  enormously. 

Enjoy  the  12th  Annual  Review. 

Harold  A.  Sabbagh 
Sabbagh  Associates,  Inc. 

4635  Morningside  Drive 
Bloomington,  IN  47408 
(812)  339-8273 
(812)  339-8292  FAX 
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1300- 1630  "Application  of  Modem  Analytical  and  Hybrid  Tools  for  Antenna 
Modeling  and  Synthesis", 

by  Roberto  Rojas  and  Prabhakar  Pathak,  The  Ohio  State  University. 
FRIDAY  MARCH  22  FULL-DAY  COURSES 


0830-1630  "Finite  Element  Methods  for  Electromagnetics" 
by  John  Volakis,  University  of  Michigan  and 
John  Brauer,  Mac-Neal-Schwendler  Corporation. 

0830-1630  "Conformal  Time  Domain  Numerical  Electromagnetics" 
by  Kane  Yee,  Lockheed. 


FRIDAY  MARCH  22  HALF-DAY  COURSES 


0830-1200  "Using  Model-Based  Parameter  Estimation  to  Increase  the  Efficiency 
and  Effectiveness  of  Computational  Electromagnetics" 
by  Ed  MiUer. 


1300-1630  "Antenna  Properties  in  Linear  and  Nonlinear  Environments" 
by  Robert  Bevensee,  BOMA  Enterprises. 
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FINAL  AGENDA 


The  Twelfth  Annual  Review  of  Progress  in  Applied  Computational  Electromagnetics 

NAVAL  POSTGRADUATE  SCHOOL 
18-22  MARCH  1996 

Richard  Gordon,  Technical  Program  Chairman 
Eric  Michielssen,  Conference  Co-Chair 
Jin-Fa  Lee,  Conference  Co-Chair 
Robert  Lee,  Short  Course  Chairman 
W,  Elliott  Hutchcraft,  Technical  Assistant 
Robert  Bevensee,  Assistant  Conference  Co-Chair 
Richard  W.  Adler,  Conference  Facilitator 


MONDAY  MORNING  18  MARCH  1996 

0745-0830  SHORT  COURSE  REGISTRATION 

0830-1200  SHORT  COURSE  (HALF-DAY) 

"An  Application  Oriented  Introduction  to  the  NEC-BSC  Workbench" 

R.J.  Marhefka  &  L.W.  Henderson,  The  Ohio  State  University 

0830-1630  SHORT  COURSE  (FULL-DAY) 

'Wavelets:  Theory,  Algorithms,  and  Applications" 

A.K.  Chan,  Texas  A&M 

0830-1 630  SHORT  COURSE  (FULL-DAY) 

"Using  Mathematical  Sofhware  for  Computational  Electromagnetics" 

J.  Lebaric,  Naval  Postgraduate  School 

0830-1 600  SHORT  COURSE  (FULL-DAY 

"Practical  EMI/EMC  Design  and  Modeling" 

T.  Hubing,  University  of  Missouri-Rolla 

0900-1200  REGISTRATION 

MONDAY  AFTERNOON 

1 300-1 630  SHORT  COURSE  (HALF-DAY) 

"Application  of  Modern  Analytical  and  Hybrid  Tools  for  Antenna  Modeling  and  Synthesis" 
R.  Rojas  &  P.  Pathak,  Ohio  State  University 


1630-1900  REGISTRATION 
1700-1915  AMATEUR  RADIO  DINNER 

SESSION  0:  AMATEUR  RADIO  SESSION 
Chair:  W.P.  Wheless,  Jr. 


1930 

1950 

2010 

2030 

2050 

2110 

2130 

2150 

2210 

2230 


"Ground-Plane  Antennas  with  Elevated  Radial  Systems" 

"Review  of  Characteristics  for  HF  Dipole  Antennas  Including  the  Cases  Where  the  Dipoles 
are  Above  and  Parallel  to  the  Surface  of  Real-World  Grounds" 

"HF  Multi-Frequency  Antennas  Using  Coupled  Resonators" 

"The  Optimized  Wideband  Antenna  (OWA)  and  its  Application" 

'The  Quad-Rhomb  Antenna  -  A  New  All  Band  Antenna  for  Amateur  Radio  Applications" 

BREAK 

"SKY-WAVES-95" 

"Using  Ham  Radio  CEM  Codes  to  Gain  Insight  to  VHF  Ground  Plane  Antennas  and  to 
Mitigate  75  Meter  Mars  RFI  at  a  Naval  Receiving  Site" 

"Development  of  Practical  Landstorfer  Antennas  for  Amateur  Use" 

'Two-Port  Network  Specification  of  Baiuns  for  NEC  Analysis  and  Other  Applications” 


103  Glasgow  Hall 
102  Glasgow  Hall 

122  Ingersoll  Hall 


419  Spanagel  Hall 
119  Ingersoll  Hall 

103  Glasgow  Hall 

102  Glasgow  Hall 

103  Glasgow  Hall 

122  Ingersoll  Hall 

J.S.  Belrose 
G.M.  Royer 

G  A.  Breed 
J.K.  Breakall 
R.  Anders 

R.  Anders 

A.  Hoffman  &  R.W.  Adler 

M.C.  Tarplee 

W.P.  Wheless,  Jr., 

&  C.S.  Wheless 
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TUESDAY  MORNING  19  MARCH  1996 
0700-0800  REGISTRATION 
0700-0745  CONTINENTAL  BREAKFAST 

0730  ACES  BUSINESS  MEETING  President  Hal  Sabbagh 

0745  WELCOME  Richard  Gordon 

SESSION  1:  HIGH  FREQUENCY  METHODS  (Parallel  with  Sessions  2, 3, 4,  &  5) 
Chair:  R  J.  Burkholder 


103  Glasgow  Hall 
Glasgow  Hallway 
102  Glasgow  Hall 
102  Glasgow  Hall 
102  Glasgow  Hall 


0800 

0820 

0840 

0900 

0920 

0940 

1000 

1020 

1040 

1100 

1120 

1200 

SESSION  2: 

0800 

0820 

0840 

0900 

0920 

SESSION  3: 

0940 

1000 

1020 

1040 

SESSION  4: 
1100 

1120 

1140 

SESSION  5: 


"Physical  Theory  of  Diffraction  Analysis  of  Impedance  Structures" 

"Hybrid  SBR/GTD  Radio  Propagation  Model  for  Site-Specific  Predictions  in  an  Urban 
Environment" 

"Analysis  of  Dielectric  Structures  Using  the  NEC-BSC" 

"Hybrid  MM-PO-Fock  Analysis  of  Monopole  Antennas  Mounted  on  Curved  Convex  Bodies" 

"Numerical  Diffraction  Coefficient  for  an  Impedance  Wedge  with  a  Material  Body  Attached 
to  its  Edge” 

"Reflection  and  Diffraction  of  Well-Focussed  General  Astigmatic  EM  Gaussian  Beams" 

BREAK 


H.H.  Syed  &  J.L  Volakis 
J.  Schuster  &  R.  Luebbers 

R.  Marhefka  &  L.  Henderson 
U.  Jakobus  &  F.M.  Landsforfer 
M.F.  Otero  &  R.G.  Rojas 

G.  Zogbi,  H.T.  Chou, 

P.H.  Pathak,  &  R.J.  Burkholder 


"Polarized  Scattered  Light  by  a  Semicylindrical  Boss  on  a  Conducting  Flat  Plane" 

"Divergence  of  Rays  in  Modulated  Atmospheric  Ducts" 

"Diffraction  by  a  Weak  Dielectric  Wedge" 

"Far-field  Diffraction  Effects  of  EUV  Fresnel  Zone  Plates" 

LUNCH 

INVERSE  SCATTERING  (Parallel  with  Sessions  1,  3, 4,  &  6) 

Chairs:  P.M.  Goggans  and  L.  Riggs 

"Radar  Time  and  Frequency-Domain  Received  Signals  for  Realistic  Antennas  and  Scatterers" 
'The  Extraction  of  Scattering  Mechanisms  from  Measured  Data" 

"Using  the  E-pulse  Technique  and  Hypothesis  Testing  to  Perform  Radar  Target  Identification" 
"A  Boundary-Integral  Code  for  Electromagnetic  Nondestructive  Evaluation" 

'The  Numerical  Analysis  of  Planar  Antennas  Buried  in  Layered  Media" 

RCS  ANALYSIS  (Parallel  with  Sessions  1,  2,  4,  &  5) 

Chair:  M.  El-Shenawee 

"A  Response  Surface  Methodology  Study  of  Electromagnetic  Data  Compression  and 
Reconstruction" 

BREAK 


H. A.  Yousif 

I. P.  Zolotarev 
A.V.  Popov 

Y.V.  Kopylov,  V. A.  Baranov, 

A  V.Popov,  &  A  Vinogradov 

Engr  Auditorium 

P.M.  Goggans  &  J.D.  Pursel 
H.M.  Chizever  &  K.M.  Pasala 
L.  Riggs,  J.  Mooney,  &  C.  Smith 
K.  Murphy  &  H.A.  Sabbagh 

J.  van  Tonder,  J.  Cloete, 

&  D.  Davidson 

Engr  Auditorium 

V.M.  Floyd,  Jr.,  A.  Terzuoli,  Jr., 
G.C.  Gerace  &  P.F.  Auclair 


"Curvilinear,  Isoparametric  Modelling  for  RCS  Prediction,  Using  Time  Domain  Integral  S.P.  Walker.  M.J.  Bluck, 

Equations”  M.D.  Pocock,  C.Y.  Leung, 

&  S.J.  Dodson 

"Double  Scatter  Radar  Cross  Sections  for  Two  Dimensional  Random  Rough  Surfaces  that  M.  El-Shenawee  &  E.  Bahar 
Exhibit  Backscatter  Enhancement" 


APPLICATIONS  OF  PARALLEL  COMPUTING  (Parallel  with  Sessions  1 ,  2,  3  &  5)  Engr.  Auditorium 

Chairs:  L.  Epp  and  K.  Naishadham 

"Solution  of  Electromagnetic  Eigenproblems  on  Multiprocessor  Superscalar  Computers"  M.P.  Debicki,  P.  Jedrzejewski, 

J.  Mielewski,  P.  Przybyszewski, 
&  M.  Mrozowski 

"Implementation  of  Hybrid  FDTD/FVTD  Conformal  Algorithm  on  a  Massively  Parallel  Computer”  J.S.  Chen  &  A.A.  Seidl 


"Parallel  CARLOS-3D  Code  Development" 


J.M.  Putnam  &  J.D.  Kotulski 


NEW  DEVELOPMENTS  IN  TLM  MODELING  (Parallel  with  Sessions  1, 2,  3,  &  4)  122  Ingersoll  Hall 

Chair:  W.J.R.  Hoefer 


0800 

0820 

0840 


"On  the  Advantages  of  ATLM  Over  Conventional  TLM"  M.  Krumpholz  &  P.  Russer 

"Advanced  Node  Formulations  in  TLM  -  The  Matched  Symmetrical  Condensed  Node  (MSCN)"  V.  Trenkic,  C.  Christopoulos, 

&  T.M.  Benson 

"A  General  and  Complete  Two-Dimensional  TLM  Hybrid  Node  Formulation  Based  on  N.  Pena  &  M.M.  Ney 

Maxwell's  Integral  Equations" 
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TUESDAY  MORNING  19  MARCH  1996 

SESSIONS:  NEW  DEVELOPMENTS  IN  TLM  MODELING  (Parallel  with  Sessions  1,  2,  3,  &  4)  (cont) 

0900  "A  General  Formulation  of  a  Three-dimensional  TLM  Condensed  Node  with  the  Modeling  of 

Electric  and  Magnetic  Losses  and  Current  Sources" 

0920  "A  Numerical  Comparison  of  Dispersion  in  Irregularly  Graded  TLM  and  FDTD  Meshes" 

0940  "Accuracy  Considerations  of  a  Class  of  Frequency-Domain  TLM  Nodes" 

BREAK 

1020  "Distributed  Simulation  of  Planar  Circuits  by  TLM  Method  in  a  Parallel  Computing" 

Environment" 

1040  "Modeling  Gyromagnetic  Media  in  Symmetrical  Condensed  Node  TLM" 

1 1 00  "A  Comparative  Performance  Study  of  Absorbing  Boundary  Conditions  in  TLM  and  FDTD" 

1120  "Modeliing  of  Coplanar  Waveguide  Discontinuities  Using  the  Alternating  Transmission  Line 

Matrix  (ATLM)  Method" 

1 1 40  "Quasi-Static  Correction  of  a  Knife  Edge  Corner  in  2D  TLM  Algorithm" 


122  Ingersoll  Hall 
N.  Pena,  &  M.M.  Ney 

F.J.  German,  J.A.  Svigelj, 

&  R.  Mittra 

S.  Chen  &  R.  Vahidieck 

B,  Isele,  J.  Schmoller, 

&  P.  Russer 

L,  de  Menezes  &  W.J.R.  Hoefer 

C.  Eswarappa  &  W.J.R.  Hoefer 
B.  Bader  &  P.  Russer 

L  Cascio,  G.  Tardioli,  T.  Rozzi, 
&  W.J.R,  Hoefer 


1200  LUNCH 

1 200  BOARD  OF  DIRECTORS  MEETING/LUNCHEON 

TUESDAY  AFTERNOON 

1400-1800  VENDOR  EXHIBITS  AND  STARTING  AT  1600  -1800,  WINE  AND  CHEESE  BUFFET 
SESSION  6:  INTERACTIVE  TECHNICAL  SESSION, 

1400-1800  "Electromagnetic  Visualization  Using  Commercial  Software" 

"Performance  of  Multiple,  Thin  Layers  of  Lossy  Dielectrics  as  Broadband  Attenuators" 


DelMonte  Room, ,  Herrmann  Hall 

Ballroom,  Herrmann  Hall 
Ballroom,  Herrmann  Hall 
H.A.  Nott 

G.W.  Jarriel,  Jr,,  M  E.  Baginski, 
&  L.S.  Riggs 


"Research  &  Engineering  Framework  (REF)  Data  Dictionary  Specification  for  Computational 
Electromagnetics" 

"Development  of  an  Electromagnetic  and  Mechanical  Simulation  Tool  for  the  Computer 
Modeling  oftheTACAMO  LFA/LF  Communication  System" 

"A  New  Look  at  Antenna  Traps" 

"Imaging  of  Conductive  and  Ferromagnetic  Materials  Using  a  Magnetic  Induction  Technique' 


J.A.  Evans 


M.C.  Longtin,  R.W.  Sutton, 
K.J.  Laskey,  &  P.J.  Morrison 

P.W.  Leonard 

J.  Ferreira,  F.  Unbares, 

J.  Velez,  J.  de  Ribomar  S. 
Oliveira,  &  A.R.  Borges 


"Investigation  of  the  Properties  of  Wavelet-Like  Basis  Functions  in  the  Finite  Element  Analysis  L.A,  Harrison  &  R.K.  Gordon 
of  Elliptic  Problems" 


"Continuing  Development  of  the  Research  and  Engineering  Framework  (REF)  for 
Computational  Electromagnetics" 


L.W.  Woo,  B.  Hantman, 

K.  Siarkiewicz,  J.  LaBeile, 
&  R.  Abrams 


"Numerical  and  Experimental  Modelling  of  Liquid  Dielectrics  Using  a  Coaxial  Cavity"  M.  Single,  D.B.  Davidson, 

&  J.H.  Cloete 


"Hardware/Software  Codesign  Model  forXPATCHF  Optimization" 
"3D  FDTD  Simulation  of  EM  Detection  of  Buried  Waste" 


B.A.  Kadrovach,  T.S.  Wailes, 
A.J.  Terzuoli,  Jr.,  &  D.S.  Gelosh 

D.  Sullivan,  B.  Hansen 
&  N.  Skousen 


"Application  of  Digital  Filters  to  the  Construction  of  Wideband  Dispersive  Boundary  Conditions"  M.  Mrozowski,  M.  Niedzwiecki, 

&  P.  Suchomski 

"Note  on  Large  Crane  Coupling  to  Nearby  AM  Radio  Stations"  P.W.  Leonard  &  J.B.  Hatfield 

"XPATCHF  Software  System  Analysis  and  Profiling"  B.A.  Kadrovach,  T.S.  Wailes, 

A.J.  Terzuoli,  Jr,,  &  D.S.  Gelosh 


'Theoretical  Studies  of  Photonic  Band  Gap  Materials" 

"On  the  Use  of  Richardson  Extrapolation  in  the  Finite  Element  Analysis  of  Two-Dimensional 
Electrostatics  Problems" 

"Scattering  from  Chirally  Coated  Bodies" 


M.  Sigalas,  R.  Biswas,  C.  Chan, 
K.  Ho,  &  C.  Soukoulis 

W.E.  Hutchcraft  &  R.K,  Gordon 
R,  Sharma  &  N.  Balakrishnan 


XX 


TUESDAY  AFTERNOON  19  MARCH  1996 


1400-1800  VENDOR  EXHIBITS  AND  STARTING  AT  1600  -1800,  WINE  AND  CHEESE  BUFFET 
SESSION  6:  INTERACTIVE  TECHNICAL  SESSION,  (cont) 

“A  Mixed  Formulation  to  Compute  the  Source  Current  Density  in  Inductors  of  Any  Shape" 


"High  Power  Microwave  Amplification  for  High-Intensity  Relativistic  Electron-Beam 
Storage-Rings" 

"Real-Time  Digital  Signal  Processor  in  Ionosphere  Measurements" 

"High  Frequency  Electromagnetic  Safety  Analysis  by  Numerical  and  Empirical  Methods  on 
Mobile  Platforms" 

"Computational  Modeling  of  Wave  Plasma  Interaction" 

"Attenuation  of  HF  Radio  Waves  in  a  Forest:  Results  from  Experiment" 


Ballroom,  Herrmann  Hall 
Ballroom,  Herrmann  Hall 

F.  Robert,  P.  Dular, 

J.F.  Remade,  M.  Um6, 

&  W.  Legros 

R.A.  Spedale 


A.L.  Karpenko  &  V.V.  Koltsov 
M.J.  Packer,  &  R.C.  Ferguson 

V.A.  Eremenko  &  Y.Cherkashin 

I.P.  Zolotarev.  V.A.Popov 
&  V.P.  Romanuk 


"Statistical  Reflection  Properties  of  Electromagnetic  Monopulse  by  Buried  Object  in 
Subsurface  Random  Ground  Using  FDTD" 

"Running  NEC4  on  the  Cray  at  NPS" 

1730  NO  HOST  BAR 

1830  AWARDS  BANQUET 

WEDNESDAY  MORNING  20  MARCH  1996 

0715  CONTINENTAL  BREAKFAST 

SESSION  7:  FDTD  APPLICATIONS  AND  ENHANCEMENTS  (Parallel  with  Sessions  8,  9, 10, 11  &  12) 
Chair:  J.H.  Beggs 


Y.  Miyazaki  &  Y.  Jyonori 
B.  Neta 

Terrace  Room 
Ballroom,  Herrmann  Hall 


102  Glasgow  Hall 


0800 

0820 

0840 

0900 

0920 


"UHF/VHF  Propagation  Model  Characterization  Over  Imegular  Terrain  Using  MOM/FDTD"  K.A.  Lysiak,  J.K.  Breakall, 

&  J.  Zmyslo 

"Validation  of  FDTD  Modeling  of  Ground-Penetrating  Radar  Antennas"  B.J.  Zook 

"FDTD  Analysis  of  Radiation  from  a  Lens  Terminated  Conical  TEM  Antenna"  S.A.  Blocher,  E.A.  Baca, 

&  T.S.  Bowen 

"FDTD  Analysis  of  a  Dipole  Antenna  Driven  from  Various  Excitation  Sources"  M.R.  Zunoubi.  N.H.  Younan 

C.D.  Taylor,  &  J.H.  Beggs, 

"An  Efficient  Hybrid  PEE-FDTD  Field  Modeling  Technique  in  Cylindrical  Coordinates"  M.  Mrozowski,  M.  Okoniewski, 

&  M.A.  Stuchly 


1000 

SESSION  8: 

1020 

1040 

1100 

1120 

1200 

SESSION  9: 

0800 

0820 

0840 

0900 

0920 


BREAK 

FINITE  ELEMENT  AND  FINITE  VOLUME  METHODS  FOR  ELECTROMAGNETIC  FIELD  SIMULATION  102  Glasgow  Hall 
Chairs:  R.  D-Edlinger  and  R.  Lee  (Parallel  with  Sessions  9, 10, 11  Sc  12, 

"Local  Tetrahedron  Modeling  of  Microeledronics  Using  the  Finite-Volume  Hybrid-Grid  D.J.  Riley  &  C.D.  Turner 

Technique" 

"Full  Wave  Vector  Maxwell  Equation  Modeling  of  Self-Limiting  Effects  and  Optical  Nonlinear  S.V.  Polstyanko  &  J-F.  Lee 
Vortices” 

"A  Hybrid  FEM-FMM  Technique  for  Eledromagnetic  Scattering"  S.  Bindiganavale  &  J.L  Voiakis 

"Finite  Element  Method  Analysis  of  the  Celestron-8  Telescope"  R.R.  DeLyser  &  H.  Pohle 

LUNCH 

NUMERICAL  ERROR  ANALYSIS  AND  CONTROL  I  (Parallel  with  Sessions  7  8, 10, 11,  &  12)  122  Engersoll  Hall 
Chair  J.L.  Voiakis 

"Error  Analysis  in  the  Adaptive  Integral  Method  (AIM)"  E.  Bleszynski,  M.  Bleszynski, 

&  T.  Jaroszewicz 

"Using  Model-Based  Parameter  Estimation  to  Estimate  the  Accuracy  of  Numerical  Models"  E.K.  Miller 

"Guidelines  for  Using  the  Fast  Multipole  Method  to  Calculate  the  RCS  of  Large  Objects”  S.S.  Bindiganavale 

&  J.L  Voiakis 

"Developments  in  Error  Estimation  for  Covolume  and  Staggered  Mesh  Approximations  to  R.A.  Nicolaides  &  D-Q.  Wang 
Maxwell's  Equations" 

'Adaptive  Methods  for  the  Numerical  Solution  of  Readion-Diffusion  Problems"  D.J.  Estep,  M.G.  Larsson 

&  R.D.  Williams 

"Error  Estimates  for  Subg ridded  FDTD  Schemes" 

BREAK 


0940 

1000 


XXI 


P.  Monk 


WEDNESDAY  MORNING  20  MARCH  1996 

SESSION  10:  NUMERICAL  LINEAR  ALGEBRA  IN  COMPUTATIONAL  ELECTROMAGNETICS 
Chain  A  S.  Model  (Parallel  with  Sessions  7,  8,  9, 11,  &  12) 

1 020  "Applications  of  Numerical  Linear  Algebra  in  Electromagnetics" 

1040  "Multilevel  Preconditioning  for  the  Time-Harmonic  Maxwell  Equations" 

1 1 00  "Iterative  Solution  Methods  for  ill-Posed  Problems" 

1120  "Methods  for  Large  Sparse  Eigenvalue  Problems  from  Waveguide  Analysis" 

1 1 40  “Iterative  Solution  of  Field  Problems  in  Space-Decoupled  Configurations" 


122  Ingersoli  Hall 

G.K.  Gothard,  J.H.  Henderson, 
&  A.S.  Model 


D.  Calvetti,  L.  Reichel, 
&  Q.  Zhang 

C.  Peng  &  D.  Boley 
G.  Burger,  &  H.  Singer 


SESSION  11:  NEC  APPLICATIONS  (Parallel  with  Sessions  7,  8,  9, 10.  8i  12) 

Chairs:  M.  Ney  and  C,  Christopoulos 

0800  "Numerical  Investigation  of  Antennas  for  Hand-Held  Radiotelephones  Using  NEC  Code" 

0820  "Evaluation  of  the  Discrete  Complex-Image  Method  for  a  NEC-Like  Moment-Method  Solution" 

0840  'The  Improvement  of  NEC-2's  Out-of-Core  Operation  and  the  Analysis  of  UHF  Monopole 

Antenna  Mounted  on  a  Car  Model" 

0900  "MatNEC:  A  MATLAB  Based  Graphical  Interface  to  SuperNEC" 


Engr.  Auditorium 

A,A.  Efanov,  M.S.  Leong, 
&  P.S.  Kooi 


K.  Natsuhara,  T.  Suda 
Y.  Kazama,  &  K.  Madono 


SESSION  12:  VALIDATION  (Parallel  with  Sessions  7.  8.  9, 10,  &  11) 

Chair:  D.R.  Pflug 

1020  "First  and  Second  Generation  Transformable  Scale  Aircraft-Like  Models  for  Code 

Validation;  Present  Results  and  Future  Plans" 

1040  "Software  for  Modeling  Helix  Antennas  with  NEC  and  Validation  by  Measurement" 

1100  "Evaluation  of  the  Sensitivity  of  Scattering  Predictions  to  Uncertainties  in  Material 

Characteristics" 

1 1 20  'Validation  of  the  PO-based  RCS  Code  SIGMA  by  Using  lEM  and  Experiments" 


Engr.  Auditorium 


D.R.  Pflug,  T.W.  Biocher 

D. E.  Warren,  &  D.O.  Ross 

C.W.  Trueman,  N.  Sultan, 

S.J.  Kubina,  &  T.  Pellerin 

G.A.  Barnhart, 

A.J.  Terzuoli,  Jr.,&  G.C.  Gerace 

E.  Kemptner,  D.  Klement, 

&  V.  Stein 


OPTIMIZATION  (Parallel  with  Session  15  &  16), 

Chair:  E.  Michielssen 

"Optimization  of  Broad-Band  Loaded  Wire  Antennas  in  Real  Environments  Using 
Genetic  Algorithms" 

"Genetic  Algorithms  Based  Pattern  Synthesis  Approach  for  Arbitrary  Array  Designs" 
"Speeding  Convergence  of  Genetic  Algorithms  for  Optimizing  Antennas  Arrays" 
"Order  Recursive  Method  of  Moments  for  Iterative  Design  Application" 


Engr.  Auditorium 

D. S.  Weile, 

E.  Michielssen,  &  A.  Boag 
Y.  Lu  &  K.K.  Yan 


K.  Naishadham  &  P.  Misra 


MULTIPOLE  TECHNIQUES  Engr.  Auditorium 

Chair:  P.  Leuchtmann 

"Discrete  Sources  Method  for  the  Silicon  Wafers  Defect  Discrimination"  Y.  Eremin  &  N.V.  Orlov 

"Iterative  Scheme  of  Discrete  Sources  Amplitudes  Determination  Based  on  D-matrix  Approach"  Y.  Eremin 
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Introduction 

From  the  earliest  days  of  radio  the 
merits  of  elevated  counterpoise  and  radial 
systems  have  been  recognized  because  of 
the  way  in  which  current  densities  in  the 
ground  are  more  or  less  uniformly 
distributed  over  the  area  of  the  insulated 
counterpoise  (which  is  in  effect  a  large 
capacitance  ground),  c.f.  Fig.  1.  This 
figure  shows  wires  running  radially 
outwards  on  insulated  supports  without 
connection  being  made  to  earth  plates  or 
ground  stakes  at  the  outer  ends  of  the 
system.  An  alternate  view  on  the  way  an 
elevated  radial  system  works  is  that  it 
allows  the  collection  of  electromagnetic 
energy  in  the  form  of  displacement 
currents,  rather  than  conduction  currents 
flowing  through  lossy  earth.  Certainly 
when  only  a  few  elevated  radial s  are  used, 
we  cannot  consider  the  counterpoise 
forming  a  large  capacitance  ground. 

The  development  of  such  ground 
systems  was,  by  necessity,  empirical  in 
nature,  as  sophisticated  instrumentation 
and  standardized  antenna  testing 
procedures  were  not  available  in  the  early 
days  of  radio.  And,  after  about  1937,  the 
use  of  elevated  radial  systems  became  an 
almost  a  forgotten  art,  because  of  research 
by  Brown  et.  al.  [1937]  in  favor  of  the 
buried  radial  system.  These  authors 
carried  out  a  very  extensive  measurement 
program  to  determine  the  radiation 
efficiency  of  a  vertical  monopole  as  a 
function  of  ground  system  parameters.  A 
result  of  their  work  leads  to  their 
recommendation  that  120  radial  wires 
each  one  half  a  wavelength  long  should 
be  used  to  maximize  antenna  efficiency. 


In  the  light  of  what  we  know  today 
[Burke  and  Miller,  1989]  it  is  surprising 
that  empirical  design,  tradeoffs  and 
optimization  procedures  did  not  emerge, 
since  the  performance  of  thousands  of  MF 
broadcast  stations  has  been  measured,  but 
most  stations  simply  used  ground  systems 
based  on  this  recommendation.  Also 
surprising,  is  that  while  radial  systems  are 
conventionally  buried,  some  stations  have 
used  insulated  radials,  and  ground 
conductivies  are  very  variable  from  one 
site  to  another,  yet  radial  anti-resonant 
effects  have  apparently  not  been  found, 
or  were  missed. 

VHF/UHF  monopole/ground  plane 
(GP)  type  antennas,  e.g.  a  quarter 
wavelength  vertical  radiator  element  with 
four  radial  rods,  have  been  used  for 
communications  for  decades.  Terman 
[1943]  describes  such  an  antenna  as  one  of 
a  class  of  flagpole  antennas.  Vertical 
antennas  with  a  few  roof  top  mounted 
quarter  wavelength  radial  wires  (a  few 
for  several  bands)  are  a  popular  choice 
for  amateur  communications,  particularly 
on  the  lower  HF  bands.  These  antennas 
are  simple  to  erect,  easy  to  tune,  and  are 
relatively  unobtrusive. 

There  is  still  confusion  between  the 
true  vertical  monopole,  with  its  'infinite' 
ground  plane,  the  real  Earth,  usually 
assisted  by  a  number  of  buried  radials,  and 
the  elevated  ground  plane  with  horizontal 
wire  radials  or  rods  sometimes  described  as 
a  'non-radiating  artificial  ground  plane'. 
But  four  radial  rods  used  with  a  monopole 
antenna  can  hardly  be  considered  to 
simulate  a  true  ground  plane.  Moxon, 
G6XN  [1992]  considers  that  the  elevated 
ground  plane  antenna  with  two,  three  or 
four  quarter  wavelength  wire  radials 
needs  to  be  considered  as  a  form  of  dipole, 
rather  than  a  vertical  ground-plane  type 
monopole  antenna.  Certainly  there  are  a 
number  of  differences.  A  monopole 
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antenna  with  four  horizontal  radial  rods 
at  a  height  of  several  wavelengths  has  an 
input  impedance  of  22-ohms,  and  can  have 
a  vertical  dipole  like  pattern.  A  ground 
plane  antenna  on  a  square  or  circular 
metal  sheet,  of  sufficient  size,  has  an  input 
impedance  of  36-ohms,  and  while  the 
maximum  gain  can  approximate  that  for  a 
monopole  on  an  infinite  metal  sheet,  for 
ground  planes  of  practical  size  there  is  a 
pattern  null  on  the  horizon  (the  plane  of 
the  finite  ground-plane),  c.f.  Foster  and 
Miller  [1981]. 

One  has  also  to  be  careful  with  GP 
type  antennas  that  are  at  a  height  of  a 
wavelengths  or  more,  and  are  fed  with 
coax.  Tilson  and  Secord  [1967]  showed  that 
the  measured  vertical  radiation  pattern 
VRP,  of  VHFAJHF  ground  rod  antennas 
varies  markedly  with  changes  in  element 
lengths  and  the  degree  to  which  RF 
currents  can  be  kept  off  the  outer  braid  of 
the  coaxial  feed  line.  These  authors 
showed  some  typical  VRPs  for  various 
ground  rod  antennas.  The  VRP  that  looked 
dipole  like  had  0.237X  radials  and  a 
monopole  height  of  0.25X,  with  chokes  on 
the  feedline. 

Elevated  counterpoise  type  of 
ground  systems  were  extensively 
experimentally  studied  by  Doty,  Frey  and 
Mills  [1983].  The  radial  systems  they  used 
were  not  unlike  that  shown  in  Fig.  1,  i.e. 
the  counterpoise  wires  filled  a  square  or 
rectangular  area  over  the  ground,  rather 
than  being  of  equal  or  a  resonant  length. 
Christman,  KB8I  [1988]  was  perhaps  the 
first  to  show  by  numerical  modeling  that  a 
vertical  monopole  with  a  few  (four)  radial 
wires  could  be  at  a  height  quite  low  over 
real  ground  before  the  radiation 
efficiency  of  the  antenna  was 
significantly  degraded.  Unfortunately  the 
topic  of  ground  mounted  and  elevated 
radials,  and  antennas  with  elevated  feed, 
has  remained  clouded  by  controversy,  e.g„ 
recent  posting  on  the  INTERNET  News 
Group  on  Radio  Amateur  Antenna;  and,  c.f. 
Belrose  [1995]. 


Continuing,  if  four  elevated  radials 
can  be  used  to  realize  performance  for  a 
monopole  comparable  with  many  buried 
radials,  then  elevated  radials  can  be  used 
for  real  antennas;  and  as  well  to  simulate 
ground  ’connection’  for  numerical 
modeling  programs  like  NEC-2,  which  does 
not  permit  a  wire  to  connect  to  lossy 
ground. 

This  paper  briefly  overviews  the 
subject  of  elevated  radials,  and  presents  a 
few  case  studies  for  different  ground 
plane  type  antennas. 

Vertical  Monopoles 

Ground  Mounted  Radial  Systems 

NEC-2  is  currently  enjoying  wide 
application  by  amateurs  in  radio,  using  PC 
based  programs  such  as  EZNEC,  available 
from  Roy  Lewallen,  W7EL;  and  NEC/Wires, 
available  from  Brian  Beasley,  K6STI.  The 
ability  of  the  NEC  codes  to  accurately  treat 
the  air-ground  interface  (based  on  the 
Sommerfeld-integral  formulation)  is 
assumed  to  be  validated  by  the  developers 
of  the  program  [Burke  and  Miller,  1989]. 

As  mentioned  at  the  outset,  the  most 
comprehensive  set  of  experimental  data  is 
that  obtained  by  Brown  and  his  colleagues, 
who  measured  impedance  and  efficiency 
for  several  antenna  heights,  employing 
buried  radial  ground  systems,  0.411,  0.274 
and  0.137  X  long.  For  each  length  of  radial 
system  2,  15,  30,  60,  and  113  radial  wires 
were  used.  While  NEC-2  is  not  suited  to 
model  these  experiments,  we  have  used 
this  code,  by  employing  ground  mounted 
radials  at  a  height  of  6.25  x  lO"^  X  (5  mm  at 
3.75  MHz),  and  have  achieved  a  rather 
close  agreement  between  antenna 
resistances  predicted  and  measured. 

For  the  case  studies  to  be  reported 
here,  we  chose,  initially,  to  model  a 
3.75  MHz  monopole  over  a  radial  wire 
system  where  all  elements  were  #  10 

copper  wire,  and  all  wires,  monopole 


3 


height  and  radials  were  20  m  long  (0.25  X 
long  for  a  wavelength  of  80  m). 

Tom  Rauch,  W8JI  [private  comm¬ 
unications,  1995]  conducted  a  series  of 
experiments  with  a  3.7  MHz  quarter 
wavelength  vertical  wire  antenna  using 
elevated  and  ground  mounted  radials.  His 
measured  relative  field  strengths  for  the 
ground  mounted  radial  systems  change,  as 
number  of  radials  is  changed,  in  a  way 
closely  like  that  predicted  by  NEC-2.  For  4, 
8  and  16  radials  on  the  ground,  his 
measured  gain  differences  referenced  to 
the  relative  field  strength  measured  for  60 
radials  on  the  ground,  were  -5.5  dB,  -2.7  dB 
and  -1.3  dB.  For  an  insulated  radial  system, 
height  of  5  mm,  with  4,  8,  16,  32  and  64 
quarter-wavelength  radials,  EZNEC  pre¬ 
dicts  gains  for  an  antenna  over  average 
ground  (a  =  3  mS/m;  e  =  13)  of  -4.95  dBi, 
-2.1  dBi,  -0.43  dBi,  +0.21  dBi  and  +0.24  dBi. 

Elevated  Radials 

All  experiments  that  the  author  is 
aware  of  have  used  radials  of  a  fixed 
length,  X/4  long  (physical  length)  as  the 
height  of  the  radial  system  was  changed. 
But  a  radial  wire  at  a  low  height  over 
finitely  conducting  ground  needs  to 
increasingly  shortened  as  its  height  is 
lowered,  in  order  to  realize  and  maintain 
resonance.  The  length  for  the  ground 
mounted  radial  case  study  described  above 
was  for  a  fixed  length  radial,  X/4  long. 
The  resonant  length  of  a  radial  wire  5  mm 
above  average  ground  is  0.138  and  so 
clearly  the  radials  were  not  resonant.  The 
impedance  at  3.75  MHz,  according  to  NEC-2, 
of  our  20  m  high  vertical  wire  antenna 
with  4-radials  20  m  long  is  Za  =  140  +  j  117 
ohms  (which  is  certainly  not  a  resonant 
monopole),  and  the  gain  -4.95  dBi.  But,  if 
4-resonant  radials  11.1  m  long  (X/ 4 
electrical  length)  are  used,  the  antenna  is 
more  or  less  resonant,  Za  =  53  -  j  2  ohms, 
and  the  gain  -  1.4  dBi. 

The  reason  for  this  can  be  visually 
seen  in  Fig.  2,  where  we  show  the  wire 
models  for  these  two  antenna  systems,  and 


the  current  distribution  on  the  wires. 
Since  a  ground  system  is  one  side  of  a  GP 
antenna,  its  purpose  is  to  provide  a  low 
impedance  against  which  the  antenna  can 
be  driven.  Ideally  this  impedance  should 
have  a  low  resistance,  and  only  a  small 
reactance,  since  we  want  to  maximize  the 
current  on  the  antenna.  While  the 
ground  plane  antenna  can  be  re¬ 
dimensioned  to  re-establish  resonance, 
this  will  change  the  current  distribution 
on  the  radiator.  Clearly  the  longer  radials 
(Fig.  2a)  do  not  well  fulfill  these 
requirements.  Since  the  radials  are  not 
resonant  the  terminal  impedance  of  the 
monopole  is  changed,  and,  since  the 
current  on  the  source  end  of  the  radials  is 
not  the  maximum  current  on  the  radial 
wires,  the  radial  system  couples  more 
current  and  hence  more  power  loss  to  the 
ground  beneath,  then  does  the  shorter 
resonant  radial  system  (Fig.  2b). 

In  Fig.  3  we  have  plotted  the 
predicted  gain  for  a  20  m  high  vertical 
wire  antenna  vs.  height  in  wavelengths 
for  4-radials  a  quarter  wavelength  long 
(20  m);  for  radials  of  resonant  length 
(length  is  a  function  of  height  and  ground 
conductivity,  which  for  this  graph  is 
average  ground);  and  the  gain  of  a  half 
wave  dipole,  for  reference,  where  height 
is  the  lower  end  height  of  the  dipole. 

There  are  only  a  few  measured 
results  to  compare  with  predicted 
performance.  Christman  [1990]  measured 
groundwave  field  strengths  at  three 
distances  for  an  8  MHz  vertical  monopole 
with  various  ground  systems.  He  made 
measurements  with  four  X/4  long  elevated 
radials  for  three  heights,  1  m,  3  m  and 
5  m,  using  direct  and  isolated  feed, 
compared  with  120  ground  mounted 
radials.  In  his  experiment  he  recorded  18 
field  strength  values,  but  he  had  to  reject 
a  block  of  4  values  since  the  results 
(impedance  and  field  strength)  were  quite 
inconsistent  with  the  full  set  of 
measurements.  His  measurements,  in  our 
view,  illustrate  the  difficulty  experienced 
in  accurately  measuring  at  a  few  sites  on 
different  days  small  differences  in  gain 
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between  different  antenna  systems.  For 
example,  his  set  of  measured  field 
strengths  at  three  sites  for  a  quarter 
wavelength  monopole  with  4-radials  at 
1  m  and  3  m,  direct  feed,  compared  with 
120  radials  on  the  ground,  correspond  to 
relative  gain  differences,  of  -4,  -2,  -1,  0,  0, 
and  +1  dB,  for  a  median  difference 
between  0  and  -1  dB, 

Beverage  [1995]  determined  the 
efficiency  of  an  experimental  0,17X.  tower 
with  six  0.25X  radials,  0.024X  high,  at  the 
operating  frequency  1580  kHz,  by 
measuring  field  intensity  along  12  radial 
directions  extending  out  to  a  distance  of  up 
to  85  km.  The  measured  RMS  efficiency 
was  287  mV/m  for  1  kW  radiated  at  1  km, 
which  is  the  same  measured  value  as 
would  be  expected  (FCC  files)  for  a  0,17A, 
tower  above  120  buried  radials. 

In  spile  of  this  good  agreement 
between  theory  and  experiment,  there  are 
doubts  based  on  measurements, 
concerning  the  realizable  performance  of 
elevated  radial  systems.  For  example, 
Rauch's  elevated  radial  measurements  do 
not  agree  at  all  with  prediction.  He 
measured  a  trend  in  change  of  relative 
gain  with  change  in  number  of  radials 
(for  a  radial  height  of  2.44  m  or  0.03X  at 
3.7  MHz)  that  is  almost  identical  with  that 
for  radials  on  the  ground.  For  4,  8  and  16 
elevated  radials,  referenced  to  60  radials 
on  the  ground,  Rauch  measured  relative 
gains  of  -4.17  dB,  -2.28  dB  and  -1.08  dB. 
For  4,  8,  16  and  32  0.25  X  elevated  radials, 
EZNEC  predicts  gains  of  -0.02  dBi,  +0.08  dBi, 
+0.13  dBi  and  +0.16  dBi. 

Further  experimental  work 
apparently  has  to  done  to  resolve  these 
apparent  differences. 

Direct  Feed  vs.  Isolated.  Feed 

There  is  no  doubt  that  a  coax  feeder 
supplying  power  to  a  monopole  with 
elevated  radials  can  have  currents 
induced  to  flow  on  the  outer  surface  of  the 
shield,  if  the  antenna  is  directly  fed. 


Beverage  [1995]  noted  that  if  he 
disconnected  the  feedline  from  the 
antenna  tuning  unit  input  to  an  1160  kHz 
ND  tower  with  elevated  radials  (radial 
height  0.018  X),  and  installed  an  RF  choke 
made  up  of  toroidal  cores  around  the 

coaxial  cable,  that  the  antenna's 
impedance  changed  by  a  small  amount, 
and  a  slight  heating  of  the  cores  indicated 
that  an  RF  current  path  did  exist  along  the 
shield  of  the  coaxial  cable.  When  the 

antenna  was  retuned,  however,  no  change 
in  field  strength  could  be  detected. 

For  our  model  with  a  radial  height 
at  0.0125  X  (1  m)  the  shield  current  is 

insignificant.  We  simulated  a  coaxial 
cable  feeder  by  dropping  a  wire  connected 
to  the  radial  system  vertically  down,  to  a 
height  of  5  mm,  and  then  running  it 

horizontally  (beneath  a  radial  for 
maximum  coupling)  for  19  and  39  m,  to 
simulate  a  quarter  and  a  half  wavelength 
feeder.  The  change  of  impedance  and 
gain  was  very  small.  The  current  on  the 
quarter  wavelength  feeder  was  0.075 
amperes,  for  a  1  ampere  source  current; 
0.08  amperes  for  a  half  wavelength  feeder. 
"Now  mind  you"  our  simulated  feeder  was 
not  connected  to  ground  at  the 
'transmitter  end'.  However,  with  such 
close  coupling  to  ground  the  current  on 
the  wire  would  quickly  decrease  to  zero, 
and  in  the  case  of  an  actual  real  antenna, 
the  coaxial  cable  could  be  buried.  We 
simulated  the  effect  of  a  feeder  for  Rauch, 
WSJI's  experimental  antenna,  since  his 
radials  were  at  a  higher  height,  0.03  X  (2.4 
m),  to  see  if  we  could  explain  the  results 
he  obtained  (previously  discussed)  with 
elevated  radials  -  but  again  the  effects 
were  small. 

Phased  Array  with  Elevated  Radials 

Ground  systems  for  directive  arrays 
have  traditionally  employed  non-over¬ 
lapping  ground  systems  of  120  buried 
radials,  i.e.  the  ground  radials  are 
terminated  at  the  intersections.  With 
elevated  radials  current  flowing  on  one 
radial  wire  can  couple  with  current 
flowing  on  an  adjacent  radial  wire 
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associated  with  the  other  element  of  the 
array,  see  Fig.  4,  which  is  a  directive 
array  comprising  two  X/4  radiators  X/  4 
apart.  Four  resonant  radials  were  used, 
0.2333  X  long,  which  corresponds  to  a 
radial  wire  height  0.0125X  (1  m  at  3.75 
MHz)  over  average  ground.  If  we  examine 
the  patterns  (not  shown  here)  for  equal 
currents,  having  phase  differences  of  0°, 
90°  and  180°,  EZNEC  predicts  rather  ideal 
patterns,  which  change  from  broadside,  to 
cardioid,  to  an  in-line  figure-of-eight 
pattern;  horizontally  polarized  component 
less  than  -28  dB  below  maximum  gain. 

It  is  interesting  to  examine  the 
currents  on  the  radials  as  the  phase 
difference  in  the  feed  currents  is 
changed.  In  Table  1  we  have  tabulated 
the  currents  on  the  radial  wires 
(maximum  current  on  the  segment  close  to 
the  junction,  for  a  source  currents  of 
1  ampere).  Notice  how  these  currents 
change  as  the  phase  difference  between 
the  source  currents  is  changed.  Whether 
this  is  important,  or  an  interesting  detail 
is  not  known.  Certainly  with  a  buried 
radial  ground  system  no  one  worried  about 
differences  that  exit  between  currents  on 
the  different  parts  of  the  radial  wire 
ground  system. 

Ground  Plane  Type  Loops 

The  author  has  for  some  fifteen 
years  used  various  types  of  ground  plane 
type  half-loops.  The  loop  can  have 
various  shapes,  a  half-quad,  a  half-delta  or 
a  half-diamond.  The  perimeter  of  the  loop 
is  an  electrical  half  wavelength.  The  loop 
is  grounded  at  one  end  and  fed  against 
ground  at  the  other  end. 

To  model  GP  type  half  loop  antennas 
using  NEC-2  we  have  used  elevated  radials, 
see  Fig.  5,  which  is  a  sketch  for  a  half 
diamond  loop  with  two  sets  of  3  elevated 
resonant  radials  (as  above  0.2333  X  long). 
Notice  that  we  have  also  added  a  wire 
connecting  the  lower  ends  of  the  sloping 
sides.  The  side  length  for  a  half-diamond 
loop  is  an  electrical  X/4  wavelength 


(0,2627  X  for  our  loop).  The  current  is  a 
maximum  at  the  fed  and  ground  ends  of 
the  loop,  and  if  these  currents  are  exactly 
equal  and  in  phase,  there  should  be  little 
or  no  current  on  the  wire  connecting  the 
lower  ends  of  the  loop.  Also,  whatever 
current  there  is,  the  phase  difference 
between  current  on  one  end  and  the  other 
end  of  this  wire  should  be  180°.  For  our 
loop,  according  to  NEC-2,  the  current  on 
each  end  of  this  wire  is  small,  0.12  amps 
(0.02  amps  for  a  half-quad  with  elevated 
radials),  compared  to  the  source  current  of 
1  amp;  and  the  phase  difference  between 
current  on  the  ends  of  this  wire  is  179°. 

The  predicted  gain  for  this  antenna 
over  average  ground,  frequency  3.75  MHz 
is  -  0.69  dBi,  and  the  impedance  (a 

resonant  antenna)  is  85  ohms.  Hence 
resonant  radials  do  indeed  simulate  an 
effective  'ground  connection',  since  the 
loop  behaves  like  a  GP  type  resonant  half¬ 
loop.  Figure  5  also  shows  the  current 
distribution  on  the  wires.  As  we  have 
noted  we  would  not  expect  any  current  on 
the  wire  connecting  the  ground  ends  of 
the  loop  to  contribute  much  to  the  radiated 
field.  We  could  therefore  eliminate  it,  see 
Fig.  6a.  When  we  do,  there  is  only  a  small 
change  in  impedance  (79  -  j  2  ohms)  and  a 
small  change  in  gain  (-  0.43  dBi).  While 
the  antenna  now  may  no  longler  look  like 
a  half-loop,  it  has  the  characteristics  of  a 
GP  loop.  Continuing,  since  the  antenna 
radiates  dominantly  in  the  broadside 
directions,  perhaps  we  only  need  radials 
in  the  broadside  directions,  see  Fig.  6b. 
Again  there  is  a  change  in  impedance  (71 
-  j  13  ohms)  and  a  change  in  gain  (0.38 
dBi).  We  show  in  Fig.  7  the  radiation 
patterns  for  Fig.  5,  6a,  and  6b's  antennas. 

Finally,  we  can  perhaps  eliminate 
the  radials  on  one  of  the  broadside 
directions,  see  Fig.  6c,  to  realize  a 
directive  antenna  system.  The  antenna 
impedance  is  changed  (110  +  j  15  ohms) 
and  the  gain  is  changed  (0.51  dBi  at  a 
launch  angle  of  31°  in  the  direction  of 
maximum  gain).  In  Fig.  8  we  show  the 
radiation  patterns  for  Fig.  6c's  antenna. 
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Notice  that  here  we  have  a  unidirectional 
antenna  system. 

The  author  has  recently  learned 
that  one  of  his  colleagues  is  using  an 
antenna  system  like  that  sketched  in 

Fig.  6c.  Eldridge,  VE7BS  erected  a  half 
diamond  loop  for  160M.  Since  he  had  an 
inadequate  ground  system  (rocky  ground), 
he  decided  to  raise  the  loop,  cut  the  base 
wire  in  the  middle,  and  swing  the  two  ends 
of  the  wire  at  right  angles  to  the  plane  of 
the  loop.  The  apex  is  at  33.5  m  and  the 
wires  sloping  down  from  the  apex  are 
each  40.8  m  long.  The  ends  of  the  sloping 
wires  are  4.6  m  above  the  ground.  The 

antenna  is  fed  by  coax,  with  a  choke 
balun.  The  center  conductor  of  the  coax  is 
connected  to  the  sloping  wire,  the  shield 

to  the  39.6  m  wire  running  at  right  angles 
to  the  loop  (which  is  now  a  radial  wire). 
Both  radial  wires  are  4.6  m  above  ground 

where  they  connect  with  the  sloping 
wires,  and  these  radial  wires  are  also 
sloped,  end  height  1.5  m.  The  antenna  was 
resonant  at  1.84  MHz  (VSWR  about  2:1). 

According  to  EZNEC  the  resonate 
frequency  is  1.86  MHz,  impedance 
105  ohms,  gain  1.4  dBi,  take-off  angle  27®, 
front/back  ratio  4.2  dB. 

Lifting  the  'half-loop'  off  the 
ground  apparently  made  a  quieter  receive 
antenna,  the  resonance  was  easier  to 

establish  and  to  maintain,  and  the  gain 

definitely  favored  the  direction  toward 
which  the  radials  run.  VE7BS  noted  that 

amateur  stations  in  the  favored  direction, 
Japan,  were  always  better  received  than 
on  a  horizontal  loop,  but  stations  in  the 
opposite  direction  (southern  and 

southeastern  USA)  were  usually  worse. 

An  Inverted-L  antenna  with 
Elevated  Radials 

In  Fig.  9a  we  show  an  SOM 

inverted-L  antenna  with  four  20  m  long 
radials.  The  vertical  part  of  the  antenna  is 
3  m  high,  and  the  length  of  the  horizontal 
arm  17  m.  We  considered  this  an 
interesting  case  to  model,  since  the 


horizontal  arm  of  the  antenna  can  be 
arranged  to  lie  directly  over  a  radial.  One 
would  expect  that  this  radial  would  carry 
the  most  current,  and  that  the  phase  of 
this  current  would  be  approximately  180® 
with  respect  to  the  source  current  (a 
transmission  line  effect).  EZNEC  tells  us 
that  this  is  exactly  the  situation.  The 
current  on  the  radial  wire  beneath  the 
horizontal  arm  of  the  inverted-L  is 
0.56  amps  (171®),  for  a  source  current  of 
1  amp  (0®).  EZNEC  also  tells  us  is  that  if  I 
increase  the  number  of  radials  to  32,  I  get 
almost  exactly  the  same  pattern,  see 
Fig.  9.  If  for  the  4-radiaI  case  we  orient 
the  horizontal  arm  of  the  antenna  in  a 
direction  which  is  the  bisector  of  the 
radial  directions,  the  current  induced  on 
the  radials  is  less,  and  the  gain  is 
significantly  less  (-2.24  dBi  compared  with 
-0.1  dBi).  Thus,  apparently,  if  one  wants  a 
sparse  radial  system  to  perform  like  a 
multiwire  radial  system,  one  should 
arrange  that  the  antenna  couples 
maximum  current  into  the  radial  system. 

Conclusions 

Elevated  radial  ground  systems  have 
many  applications:  1)  to  realize  achievable 
gain  for  a  GP  type  antenna  over  real 
ground  with  only  a  few  radials;  2)  to 
provide  versatility,  e.g.  control  of 
radiation  pattern;  and  3)  for  modeling 
purposes  one  can  simulate  a  "connection" 
to  ground.  For  the  case  studies  that  we 
have  described,  a  connection  to  ground 
was  necessary,  since  the  antenna  was  a 
ground  plane  type.  But,  for  purposes  of 
modeling,  a  connection  to  ground  is 
sometimes  wanted  to  simulate  the  effect  of 
support  structures,  or  other  grounded 
towers  in  the  vicinity  of  the  antenna 
system.  To  work  well,  and  to  not  change 
the  resonant  frequency  of  the  GP 
antenna,  the  radials  must  be  resonant. 
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Table  1 

Currents  on  radials  for  a  two-element  phased  array, 
see  Fig.  4. 


Wire 

Number 

Phase  difference  source  currents 
0-degrees  90-degrees  180-degrees 

1 

0.11/148° 

0.3/152° 

0.36/-175° 

2 

0.37/170° 

0.3/-162° 

0.16/-160° 

3 

0.28/-168° 

0.21/-174° 

0.25/170° 

4 

0.28/-168 

0.21/-174° 

0.25/170° 

6 

0.32/175° 

0.22/-103° 

0.2/15.6° 

7 

0.32/175° 

0.22/-104° 

0.2/15.6° 

8 

0.17/-172° 

0.22/104° 

0.31/-9.8° 

9 

0.17/172° 

0.28/-79° 

0.31/-9.8° 

Fig.  1  T-  antenna  over  a  counterpoise  earth 

[after  Admiralty  Handbook  of  Wireless 
Telegraphy.  1938]. 


8 


g 


Elevation  Plot 
Azinutii  Angle  =  0.0  Deg. 


REVIEW  OF 

CHARACTERISTICS  FOR  HF  DIPOLE  ANTENNAS  INCLUDING 
THE  CASES  WHERE  THE  DIPOLES  ARE  ABOVE  AND  PARALLEL 
TO  THE  SURFACE  OF  REAL-WORLD  GROUNDS 

by 

G.  M.  Royer 
ABSTRACT 

The  author  has  written  a  95  page  paper  [4]  which  contains  a  comprehensive  study  of  the  characteristics 
for  half-wave  resonant  HF  dipole  antennas,  for  the  cases  where  the  dipoles  are:  a)  in  free  space,  and  b)  above 
and  parallel  to  the  surface  of  a  ground.  The  types  of  ground  considered  include  those  which  are  perfectly 
conducting  and  real-world;  where  real-world  ground  classifications  range  over  what  will  be  called,  very  dry, 
medium  dry,  and  wet.  The  long  length  of  [4]  precludes  its  comprehensive  presentation  here;  the  author  will 
therefore  make  available,  at  the  conference,  a  number  of  copies  of  [4]  for  distribution  among  those  who  are 
interested  in  its  contents. 

1.  INTRODUCTION 

The  computations  used  to  obtain  the  characteristics  contained  in  both  this  document  and  [4]  were  done 
with  the  aid  of  method  of  moments  program  NEC-3  (version  3  of  Numerical  Electromagnetic  Code)  [1].  Note 
that,  although  the  author  employed  NEC-3,  NEC-2  would  have  served  as  well. 

Listed  below  are  the  half-wave  resonant  dipole  chararacteristics  which  are  investigated  in  [4]  (see  the 
next  section  for  their  definition):  a)  normalized  resonant  dipole  length,  b)  input  (feed  gap)  resistance,  c) 
SWR  =  2  bandwidth,  d)  zenith  gain,  and  e)  gain  patterns.  Due  to  length  limitations  on  this  proceedings 
paper,  very  little  consideration  will  be  given  here  to  the  chracteristics  listed  above  under  a)  and  e). 

2.  DEFINITIONS  FOR  VARIABLES 

Figure  1  illustrates  variables  associated  with  the  dipole’s:  a)  dimensions,  b)  location  with  respect  to  the 
to  surface  of  the  ground  (the  XY  plane),  and  c)  orientation  in  the  spherical  (r,  coordinate  system.The 
dimensions  I,  g,  d  and  h  will  respectively  be  referred  to  as  dipole:  length,  feed  gap  width,  wire  diameter, 
and  height  above  the  ground.  The  frequency  for  the  signal  fed  to  the  feed  gap  will  be  symbolized  by  /  (Hz) 
(or  w  (radians/sec.)  where  w  =  27r/).  The  signal’s  free  space  wavelength  is  A  where  X  =  c/f  and  c  is  the 
speed  of  signal  propagation  in  free  space  (c  ~  299.8  x  10®  m/sec.).  The  dipole’s  input  (feed  gap)  impedance 
is  expressed  as  .^a  =  -f  jXa  in  ohms(G) 

The  half-wave  resonant  condition  for  the  dipole  occurs  when  Xa  =  0  and  I  is  approximately  equal  to, 
but  less  than,  A/2.  When  the  dipole  is  half-wave  resonant  let  us  symbolize  /,  A  and  Za  by  respectively  fo, 
A<,  and  Z^o  =  R-ao  +  jO.  For  the  case  where  the  dipoles  are  half-wave  resonant,  the  table  and  graphs  (in 
both  this  document  and  [4])  will  employ  normalized  representations  for  /,  g,  d  and  h;  in  particular  we  will 
be  using:  (V(V2))^>  [9/^0,  {dlX)o  and  h/Xo-  .  . 

Let  SWR  represent  the  standing-wave  ratio  which  would  exist  on  a  lossless  transmission  line,  connected 
between  a  signal  generator  and  the  dipole’s  feed  gap,  where  the  transmission  line’s  characteristic  impedance 
is  made  to  be  the  dipole’s  resonant  feed  gap  resistance  [Rao)-  Subistitution  of  values  for  R^o  and  Za  as 
a  function  of  f/fo  into  eqns.  (2)  and  (3)  of  [4],  make  it  possible  to  compute  SWR  as  a  function  of  ///o- 
Figure  2  illustrates  Za  (and  the  resulting  as  functions  ///<,,  for  the  case  where  the  dipole  is  in  free 

space  and  (d/A)^  =  140  x  10“®.  Let  /_  and  /+  be  the  first  frequencies  encountered,  as  /  is  respectively 
decreased  and  increased  from  fo,  where  SWR  =  2.  The  dipole’s  SWR  =  2  bandwiwth  is  defined  to  be 
bw  =  ((/+  -  /_)//£,)  100  %.  The  above  SWR  -  2  type  of  bandwidth  is  mentioned  (for  example)  on  page 
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17-2  of  [2].  Its  use  is  justified  by  the  observation  that  it  becomes  difficult  to  match  a  transmitter  to  a 
transmission  line,  at  those  frequencies  where  the  SWR  on  the  line  is  greater  than  2. 

We  will  symbolize  a  dipole’s  B  polarized  gain,  in  the  direction  [6,  <f)),  by:  a)  Ge{6,  in  general,  and  b) 
Geo{0,<i>),  when  the  dipole  is  resonant.  The  above  gains  are  said  to  be  6  polarized  because  (see  eqns.  (7) 
and  (8)  in  [4])  they  are  associated  with  the  component  of  the  far  electric  field  intensity  (at  (r,  0,  <^))  which 
is  polarized  parallel  to  the  9  unit  vector.  and  are  defined  like  the  gains  under  a)  and  b) 

above,  except  that  they  are  associated  with  the  component  of  the  far  electric  field  intensity  which  is  polarized 
parallel  to  the  <f)  unit  vector. 

We  are  using  complex  notation  where  exp{juit)  is  understood;  therefore  the  ground’s  relative  permittivity 
can  be  expressed  as  €rg  =  4^  “  J^rg  ^^ere  e’^g  and  are  respectively  the  real  and  imaginary  parts  of  the 
ground’s  relative  permittivity.  At  HF  frequencies,  it  is  traditional  to  specify  ground  conductivity  instead  of 
e'^g.  These  two  variables  are  related  by  where;  a)  (Tg  in  (mhos/m)  or  S/m  (siemens/m), 

is  the  ground’s  conductivity,  and  b)  Co  is  the  permittivity  for  free  space  (fo  ~  8.854  x  sec./(nm)). 

3.  DIPOLE  CHARACTERISTICS 

For  the  case  where  the  dipole  is  in  free  space,  we  will  be  content  here  with  the  presentation  of  the 
characteristics  shown  in  Table  1.  Reference  [4]  contains,  in  addition,  figures  which  present,  as  functions  of 
{d/X)c  (for  16  X  10"®  <  id/X)o  <  520  x  10"®),  the  following  free  space  dipole  properties:  ((//(A/2))^,  Rao, 
and  SWR  =  2  frequency  band  properties.  Note  that  the  foregoing  range  for  {d/X)o  is  sufficient  to  include 
the  diameters  of  AWG  nos.  4  to  14  for  frequencies  across  the  HF  band  (i.e.  3  MHz  to  30  MHz). 

Table  1,  Characteristics  for  Fig.  I’s  free  space,  half-wave  resonant  dipole, 
as  a  function  of  the  {d/X)o  values  shown  at  the  top  of  the  table. 


{d/X)o 

16  X  10"® 

140  X  10"® 

520  X  10-‘ 

0/(A/2)), 

.9764169 

.9692884 

.9621924 

Rao  {^) 

69.7558 

68.4934 

67.0544 

f-/fo 

.9732212 

.9660110 

.9593826 

U/fo 

1.0291353 

1.0379030 

1.0463694 

if-~fo)/fo 

-.0267788 

-.0339890 

-.0406174 

U-fo)/fo 

.0291353 

.0379030 

.0463694 

iU  -  f-)/fo 

.0559142 

.0718920 

.0869868 

G«.(0,0)  (dB) 

2.28932 

2.35642 

2.44066 

Figures  3  and  4  plot  Rao  and  bw  for  the  case  where  Fig.  I’s  dipole  is  above  and  very  close  to  a  perfectly 
conducting  ground.  It  is  shown  that,  when  h/Xo  is  small  and  decreasing,  both  Rao  and  bw  are  small  and 
decreasing.  Rao  behaves  this  way  because,  for  a  given  feed  gap  current,  as  h/Xg  decreases  toward  a  small 
value,  the  far  radiated  fields  from  the  dipole  and  its  image  cancel  more  and  more  completely;  resulting  in  a 
decreasing  radiated  power.  The  dipole’s  bw  has  the  above  discussed  property  because  Rao  has  this  property 
whereas  the  rate  of  change  of  Aa ,  as  a  function  of  /,  does  not.  In  contrast  to  the  behavior  of  Rao  and  bw,  Fig. 
8  in  [4]  shows  that  when  our  dipole’s  normalized  height  (h/Xo)  above  a  perfectly  conducting  ground  is  small 
and  decreasing,  (l/(A/2))^  is  increasing  and  approaching  1.  This  is  the  behavior  is  what  one  would  expect 
because  as  h/X^  approaches  0,  the  dipole  (with  its  image)  will  behave  more  and  more  like  two  back-to-back, 
non-radiating,  quarter-wave  resonant,  transmission  lines. 

The  reader  is  referred  to  Section  4.2  of  [4]  for  a  study  of  the  characteristics  of  Fig.  I’s  dipole,  above 
a  perfectly  conducting  ground,  where  .1  <  h/Xo  <  1-6-  Included  in  this  study  are:  a)  (//(A/2))^,  Rao  and 
bw,  as  functions  of  h/Xo,  for  the  cases  where  {d/X)o  are  16  x  10~®,  140  x  10"®  and  520  x  10"®,  b)  Ge<,(0)0) 
(zenith  gain),  as  a  function  of  h/Xo,  where  {d/X)o  is  140  x  10"®.  Note  that  gain  is  not  a  sensitive  function 
of  {d/X)o,  and  c)  a  selection  of  vertical  gain  patterns  (Gfl(,(^,  0))  and  G0<,(0, 90'))  for  selected  k/Xo  values 
and  where  (d/A)o  =  140  x  10"®. 

Section  5  of  [4]  contains  an  examination  of  resonant  characteristics  for  Fig.  I’s  dipole  where  the  ground 
types  are  real-world.  Listed  below  are  the  real-world  ground  names,  with  their  associated  dielectric  properties 
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(both  taken  from  [3]),  which  we  will  be  using. 

a)  Very  dry  ground,  e'^g  =  3,  =  .0001  fi”  V™- 

b)  Medium  dry  ground,  =  15,  (Tg  =  .001 

c)  Wet  ground,  =  30,  cr^  =  .01 

In  Section  5  of  [4]  it  was  found  necessary  to  add  two  independent  variables  (in  particular,  frequency 
and  ground  type)  to  those  employed  for  the  case  where  the  ground  was  perfectly  conducting.  Note  that 
frequency  had  to  be  added  because  (as  [3]  shows),  while  (7g  is  essentially  constant  as  a  function  of  /,  for 
/  <  30  MHz,  it  follows  that  (refer  to  the  previously  given  function  relating  e' Cg  and  w)  is  not.  In  the 
interest  of  putting  a  reasonable  limit  on  the  number  of  graphs  in  the  above  Section  5,  the  author  decided  to 
present  dipole  characteristics  for  only  one  wire  diameter.  The  wire  diameter  chosen  was  that  for  AWG  no. 
12  (.205  cm).  This  is  the  guage  recommended  on  page  33-1  of  [2],  for  long  horizontal  dipole  wires  which  are 
expected  to  support  their  own  weight,  plus  that  of  their  feed  lines. 

Subsections  5.1  and  5.2  in  [4]  examine  the  cases  where  the  normalized  height  for  Fig.  I’s  dipole,  above 
real-world  grounds,  is  respectively  in  the  following  ranges:  .003  <  h/Xg  <  .1  (when  the  dipole  said  to  be 
close  to  the  ground)  and  .1  <  <  1.6.  Both  sections  contain  figures  which  plot  the  following  dipole 

characteristics  as  functions  h/Xg:  (//(A/2))^,  Rao,  bw,  and  Geo(0,0).  There  are  four  plots  for  each  of  the 
above  functions;  one  for  each  of  the  following  values  for  fo-  3  MHz,  6  MHz,  10  MHz,  and  30  MHz.  Each 
of  the  above  figures  has  at  least  three  curves;  one  for  each  of  the  real-world  ground  types.  Some  of  the 
graphs,  when  /©  =  6  MHz,  have  an  additional  curve  for  the  case  where  the  ground  is  perfectly  conducting. 
In  addition  to  the  above  figures:  a)  Section  5.1  has  a  graph  showing  the  vertical  patterns  Gflo(^,0)  and 
G^o(5,90°),  for  the  case  where  /o  =  6  MHz  and  h/Xo  =  .003,  and  b)  Section  5.2  has  a  number  of  plots 
showing  the  above  vertical  gain  patterns,  at  fg  —  &  MHz,  for:  each  of  the  real-world  ground  types,  and 
selected  values  of  h/Xo- 

This  document’s  characteristics  for  Fig. I’s  dipole  when  above  real-world  grounds,  are  contained  in  Figs. 
5  to  10.  They  plot  Rao,  bw  and  Geo{0, 0),  as  functions  of  h/Xo,  where  /o  =  6  MHz.  Figures  5  to  7  and  8  to  10 
present  characteristics  for  respectively  the  cases  where  h/Xo  satisfies:  .003  <  h/Xo  <  -1  and  .1  <  h/Xg  <  1.6 
When  the  dipole  is  close  to  the  ground,  it  is  shown  that  losses  in  the  real-world  ground  make  the  dipole’s 
characteristics  behave  much  differently  than  they  do  for  the  case  where  the  ground  is  perfectly  conducting. 
In  particular,  when  h/Xo  is  decreased  toward  a  small  value,  it  is  shown  that: 

a)  when  the  ground  is  perfectly  conducting:  1)  (//(A/2))^  increases  toward  1,  2)  both  Rao  and  bw  decrease 
toward  a  small  value,  and  3)  G«o(0,0)  increases  toward  a  peak  value  of  about  9.2  dB,  whereas 

b)  when  the  ground  is  real-world:  1)  (f/(A/2))  decreases  to  values  between  .82  and  .91  at  hXg  =  .003 
(depending  on  the  frequency  and  ground  type),  2)  both  Rao  and  bw  increase  to  above  free  space  values,  and 
3)  Gso(0, 0)  decreases  to  values  between  -2  dB  and  -12  dB  at  h/Xg  ~  .003. 

When  the  dipole’s  normalized  height  above  real-world  grounds  is  in  the  range  .1  <  h/Xg  <  1.6,  it  is 
shown  that  (l/(X/2))  ,  Rao,  bw  and  Geo(0,0),  as  functions  of  h/Xg,  vary  about  their  free  space  values.  As 
concerns  the  above  variations,  it  is  shown  that  their;  a)  period  is  about  .5,  and  b)  amplitude  increases  as  the 
ground’s  moisture  content  increases.  The  period  of  .5  for  the  dipole’s  feed  gap  characteristics  (i.e.  (// (A/2))^, 
Rao  and  felt;)  is  explained  when  it  is  noted  that  the  field  from  the  dipole,  to  the  ground,  and  then  back  to 
the  dipole,  changes  phase  by  about  360’  when  h/Xo  changes  through  .5.  The  far  field  characteristic  Gffo(0, 0) 
has  a  period  of  .5  because  the  zenith  fields  from  the  dipole  and  the  ground,  change  in  phase  by  about  360° 
when  h/Xo  changes  through  .5.  The  above  mentioned  increase  in  the  amplitude  of  variation  in  the  dipole’s 
characteristics,  is  a  result  of  the  fact  that  the  ground’s  reflectivity,  and  hence  the  strength  of  the  ground 
reflected  signal,  increases  as  soil  moisture  is  increased. 

4.  CONCLUSIONS 

The  ground  and  its  type  (dry,  wet,etc.)  is  shown  to  have  a  significant  effect  on  the  characteristics  of 
horizontal,  half-wave,  HF  dipole  antennas.  The  information  in  [4]  should  make  it  possible  for  the  antenna 
designer  to  estimate  the  magnitude  of  the  above  effects. 

An  example  of  the  use  of  the  information  [4],  is  for  the  design  of  a  system  which  employs  an  HF  dipole 
which  is  close,  and  parallel,  to  a  real-world  ground.  When  compared  with  horizontal  HF  dipoles  located  at 
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greater  heights  above  the  ground,  this  dipole  would  be  relatively  easy  to  transport  and  erect.  In  addition  it 
possesses  a:  a)  convenient  Rao  value,  and  b)  relatively  broad  bandwidth.  The  designer  of  a  system  which 
employs  this  dipole  should  however  take  into  consideration  that:  a)  its  resonant  length  changes  significantly 
as  a  function  of  groumd  type,  and  b)  it  has  low  gain  characteristics. 
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Fig.  1,  Shown  above,  for  the  case  where  the  dipole  is  above  and  parallel  to 
a  ground  with  a  plane  surface  (the  XY  plane),  are  symbols  associated  with 
the  dipole's:  a)  dimensions,  and  b)  location  in  the  (x.y.z)  andCr.fl,^)  coord¬ 
inate  systems.  The  ground  is  homogenous  and  located  in  the  region  z^O. 

The  symbols  d,  g.  /  and  h  will  be  referred  to  as  respectively  dipole;  wire 
diameter,  feed  gap  length,  length  and  height  obove  the  ground.  In  most  of 
this  paper,  the  above  variables  will  be  made  dimensionless  through  their 
normalization  wrt  the  signal’s  free  space  wavelength  (X).  In  particular  we 
will  be  using  d/X.  g/X.  //(X/2)  and  h/X  (or  when  the  dipole  is  resonant: 
(d/X)^.  (g/X)jj.  and  (//(X/2))q  and  h/X^).  For  all  cases,  we  will  moke: 
a)  (g/X)o=,0052.  and  b)  the  dipole  wires  to  be  perfectly  conducting.  For 
those  cases  where  the  ground  is  not  perfectly  conducting,  d  will  be  .205  cm 
(the  diameter  for  AWG  »I2  wire).  When  the  dipole  is  in  free  space:  h=0. 
end  there  is  no  ground. 
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Fig.  2a.  Zg  =  Rq+jXq  as  a  function  of  f/f^. 
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Fig.  2b.  SWR  as  a  function  of  f/fg. 


Fig.  2.  For  Fig.  Vs  dipole  In  free  space  (with  Cd/X)Q=140xlO"*).  input 
impedance  (Zg=Rg+jXg)  standing-wave  ratio  (SWR).  as  functions  of  normal¬ 
ized  frequency  (f/f^).  Note,  the  dipole  is  resonant  at  f=fQ  (where  Xo=0). 
Refer  to  the  text  for  a  definition  of  the  dipole's  standing-wave  ratio.  The 
normalized  frequencies  f-Zf^  and  f+Zfg  are  the  values  of  f/fg.  respect¬ 
ively  below  and  above  f/fQ=l.  where  SWR=2. 
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h/X„ 

Fig.  3,  For  Fig.  I's  dipole,  the  resonant  input  resistance  as  a  function  of 
normalized  height  above  the  ground  plane,  where:  a)  the  ground  is  perf¬ 
ectly  conducting,  b)  the  dipole  is  close  to  the  ground  (i.e.  .003^h/XQ^.100). 
and  c3  Cd/X)o=H0xl0~6. 


(%)  OOtC  }/(■ 


0.003  0.010  0.500 


h/X^ 

Fig.  4,  For  Fig.  I's  dipole,  the  SWR=2  normalized  bandwidth,  as  function  of 
normalized  height  above  the  ground  plane,  where:  a)  the  ground  is  perf¬ 
ectly  conducting,  b)  the  dipole  is  close  to  the  ground  (i.e.  .OOS^h/Xg^ 
.100).  and  c)  Cd/X)^  has  the  values  listed  in  the  legend. 
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h/X„ 

Fig.  5,  For  Fig.  I’s  dipole,  the  resonant  input  resistance,  as  a  function  of 
normalized  height  above  the  ground,  where:  a)  the  resonant  signal  freq¬ 
uency  is  6  MHz.  b)  there  is  a  curve  for  each  of  the  ground  types  indicated 
in  the  legend,  and  c)  the  dipole  is  close  to  the  ground  Ci.e.  .003^h/Xo^ 
.100). 
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h/X^ 

Fig.  6.  For  Fig.  I’s  dipole,  fhe  SWR=2  normalized  bandwidih,  as  a  function 
of  normalized  height  above  the  ground,  where:  a)  the  resonant  signal 
frequency  is  6  MHz,  b)  there  is  a  curve  for  each  of  the  ground  types 
indicated  in  the  legend,  and  c)  the  dipole  is  close  to  the  ground  Ci.e. 
.OOa^h/Xo^.lOO). 
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Fig.  7.  For  Fig.  I's  dipole,  •the  resonant  zenith  gain,  os  a  function  of 
normalized  height  above  the  ground,  where;  a)  the  resonant  signal  freq¬ 
uency  is  6  MHz.  b)  there  is  a  curve  for  each  type  of  ground  indicated  in 
the  legend,  and  b)  the  dipole  is  close  to  the  ground  (i.e.  .OOB^h/X^^-lOO). 
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Fig.  8.  For  Fig. Vs  dipole,  the  resonant  input  resistance,  as  a  function 
of  normalized  height  above  the  ground,  where:  a)  the  resonant  signal  freq¬ 
uency  is  6  MHz,  b)  there  Is  a  curve  for  each  type  of  ground  indicated  in 
the  legend  (note  that  there  is  a  curve  for  the  case  where  the  ground  Is 
perfectly  conducting),  and  c)  h/Aj,  ranges  between  .1  and  1.6.  Also  shown, 
for  comparison  purposes,  is  the  resonant  input  resistance  for  the  above 
dipoles  when  they  are  in  free  space. 
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h/X^ 


Fig.  9.  For  Fig. Vs  dipole,  the  SWR=2  normalized  bandwidth  <%).  as  a  funct¬ 
ion  of  normalized  height  above  the  ground,  where:  a)  the  resonant  signal 
frequency  is  6  MHz.  b)  there  is  a  curve  for  each  type  of  ground  indicated 
in  the  legend,  and  c)  h/X^  ranges  between  .1  and  1.6.  Also  shown,  for  com¬ 
parison  purposes,  is  the  SWR=2  normalized  bandwidth  (%)  for  the  above 
dipoles  when  they  are  in  free  space. 
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Fig.  10.  For  Fig.  I's  dipole,  the  resonant  zenith  gain,  as  a  function  of  norm¬ 
alized  height  above  the  ground,  where:  a)  the  resonant  signal  frequency 
is  6  MHz.  b)  there  is  a  curve  for  each  type  of  ground  indicated  in  the  leg¬ 
end,  and  c)  h/Xp  ranges  between  ,1  and  1.6.  Also  shown,  for  comparison 
purposes,  is  the  resonant  gain  (GqqCO.O),  CdB))  for  the  above  dipoles  when 
they  are  In  free  space 
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Introduction 

This  paper  is  a  continuation  of  material  presented  at  the  1995  ACES  Symposium  [1], 
describing  the  author’s  work  with  multi-frequency  antennas  which  use  the  technique  of  closely- 
spaced  coupled  resonators.  Three  practical  antennas  are  described,  which  operate  in  several  of 
the  HF  amateur  radio  bands. 

The  Coupled-Resonator  Principle 

The  coupled-resonator  (also  called  multiple-open-sleeve)  principle  is  a  useful  technique  for 
obtaining  multi-frequency  operation  in  dipole  and  monopole  antenna  elements.  It  is  derived 
from  previous  work,  the  coaxial  sleeve  antenna  and  the  open-sleeve  antenna.  The  coaxial  sleeve 
antenna  places  a  cylindrical  sleeve  around  a  driven  dipole  (or  monopole),  resulting  in  a  single 
structure  which  exhibits  two  resonances  —  that  of  the  driven  element,  and  of  the  sleeve.  In  the 
open-sleeve  antenna,  the  sleeve  is  reduced  to  two  parallel  conductors  which  create  a  skeleton 
representation  of  a  coaxial  sleeve.  Coaxial  sleeve  and  open-sleeve  techniques  have  been  used 
extensively  in  two-frequency  and  broadband  antennas. 

Because  the  original  work  in  this  area  involved  coaxial  or  open-sleeve  configurations,  most 
developmental  work  assumed  that  the  additional  resonators  were  required  to  follow  that  type  of 
construction.  Relatively  few  antenna  designers  understood  that  only  a  single  additional  conduc¬ 
tor  is  required  to  create  a  second  resonance.  This  simpler  topology  allows  individual  conductors 
to  be  placed  radially  around  the  driven  element,  creating  antennas  with  three,  four,  five,  and 
more  resonances.  The  upper  limit  to  the  number  of  frequencies  that  can  be  obtained  in  a  single 
structure  is  determined  by  unwanted  coupling  among  the  many  different  conductors. 

Using  MININEC  and  NEC  computer  modeling,  combined  with  several  experimental  anten¬ 
nas,  the  author  has  derived  an  equation  that  relates  the  conductor  size,  spacing,  and  the  feed- 
point  impedance.  The  equation  is  based  on  the  basic  relationship; 


log(d) 

log(D/4) 


=  0.54 


where, 

•  D  is  the  diameter  of  the  conductors  (assumed  to  be  equal)  in  wavelengths  at  the 
resonant  frequency  of  the  coupled  resonator. 

•  d  is  the  spacing  on  centers  between  the  conductors,  also  in  wavelengths  at 
the  resonant  frequency  of  the  coupled  resonator. 

•  The  system  impedance  is  that  of  a  dipole  in  free  space,  or  a  monopole  fed  against 
perfect  ground. 
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•  The  lengths  of  the  conductors  have  been  adjusted  for  a  non-reactive  impedance. 

•  A  two-frequency  antenna  is  assumed,  although  the  equation  is  a  reasonable 
approximation  for  each  frequency  in  a  larger  system. 

Correction  factors  have  been  derived  for  other  impedances,  and  for  the  special  case  where 
the  coupled  resonator  approaches  the  driven  element  resonant  frequency.  These  are  described  in 
[1]  and  [2],  and  they  are  combined  with  the  above  expression  into  a  single  equation: 
where, 


din  -  lo[0-54Iog(D/4)]  Zq  +35.5  ^-[(((Fn/Fi)-l.l)xll.3)+0.1] 


•  Zq  is  the  desired  impedance  (resistive,  between  25  and  125  ohms) 

•  Fjj  is  the  frequency  of  the  additional  resonator 

•  Fj  is  the  frequency  of  the  driven  element 

•  dj^jj  is  the  spacing  between  the  driven  element  and  the  desired  nth  resonator,  in 
wavelengths  at  F^^ 

•  D  is  the  diameter  of  the  conductors  (assumed  equal),  in  wavelengths  at  F^^ 

The  above  equation  describes  conditions  for  resistive  impedances.  The  length  of  the  ele¬ 
ments  can  be  adjusted  to  change  the  reactance.  It  is  easily  seen  that  this  technique  provides  a 
wide  range  of  control  over  the  feedpoint  impedance  at  a  given  frequency  —  by  controlling  the 
variables  of  conductor  diameter,  spacing  and  length.  This  flexibility  in  design  is  the  principal 
advantage  of  the  coupled  resonator  technique,  and  is  the  key  to  successful  operation  of  the 
antenna  described  in  Example  3  presented  below. 

Example  1: 10.1,  18.068,  24.89  MHz  Wire  Dipole 

The  first  example  is  a  three-frequency  antenna  built  with  #12  AWG  wire  conductors,  operat¬ 
ing  on  the  10.1,  18.068  and  24.89  MHz  amateur  bands.  Antenna  element  dimensions  are  given 
in  Figure  1.  A  free-space  comparison  of  radiation  patterns  on  the  three  bands  (modeled  in 
ELNEC)  is  shown  in  Figure  2.  The  radiation  patterns  at  10.1  and  18.1  MHz  are  indistinguish¬ 
able  from  that  of  an  ordinary  dipole.  At  24.9  MHz,  the  current  in  the  driven  element  is  signifi¬ 
cant,  and  when  summed  with  the  coupled-resonator  current,  results  in  a  gain  of  0.6  dB. 

Feedpoint  VSWR  in  a  50-ohm  system  is  comparable  to  MININEC-calculated  VSWR  at  the 
installed  height  of  50  feet  above  ground.  The  amateur  bands  covered  by  this  antenna  only 


46’  5” 

26’  0” 

.  . 

. zZ . J 

1  1 

L . 

I 

18’ 11 

1  \ 

insulating  spacers 

^  nylon  cord 

spacing  between  wires  1 

maintained  at  2  inches  o.c. 

1  \ 

Figure  1.  Dimensions  of  the  10.1,  18.068,  and  24.89  MHz  wire  dipole  of  Example  1. 
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require  0.4  to  0.55  percent  bandwidth, 
well  within  the  bandwith  provided  by 
this  antenna.  The  only  unexpected 
behavior  of  this  antenna  is  a  greater  sen¬ 
sitivity  to  height  above  ground  than  a 
simple  dipole.  This  behavior  had  not  been 
investigated  through  modeling,  and  it 
was  confirmed  later  that  modeling  also 
predicts  these  impedance  excursion. 

On-air  performance  of  this  antenna  is 
comparable  to  that  expected  for  a  dipole 
at  each  frequency. 

Example  2:  14,  21,  28  MHz  Aluminum 
Tubing  Dipole 

The  second  example  antenna  is  also  a 
three-frequency  antenna,  built  for  the  14, 
21  and  28  MHz  amateur  bands,  using 
aluminum  tubing  construction  (dimen¬ 
sions  in  Figure  3).  The  14  and  21  MHz 
elements  taper  from  1-1/4-inch  diameter 
tubing  to  5/8-inch  diameter,  and  the  28 
MHz  element  tapers  from  1-1/8-inch 
diameter  to  5/8-inch  diameter.  Spacing 
between  the  driven  14  MHz  dipole  and 
the  coupled-resonator  elements  was  fixed 
at  7  inches  on  centers. 

VSWR  bandwidth  exceeds  4  percent 
at  14  MHz,  and  is  greater  than  2.5  per¬ 
cent  at  both  21  and  28  MHz.  This  is  suffi¬ 
cient  to  cover  the  entire  amateur  band  at 
14  and  21  MHz,  and  covers  the  most 
actively-used  portion  of  the  28  MHz 
band.  Larger  conductors  were  intention¬ 
ally  chosen  for  these  wider  amateur 
bands.  Although  operating  on  different 
frequencies  than  the  previously- 
described  wire  antenna,  a  comparison 
shows  that  the  larger-conductor  antenna 
hss  more  than  three  times  the  percentage 
bandwidth  for  2:1  VSWR. 

The  larger  conductors  mitigate  the 
variations  in  impedance  versus  installed 
height  that  were  first  seen  with  the  wire 
antenna.  As  with  other  test  antennas, 
this  version  demonstrates  that  larger 
dimensions  result  in  a  less  critical 
design,  because  of  the  inherently  broader 
bandwidth  of  the  larger  diameter  conduc- 


(top);  18.1  MHz  (center);  24.9  MHz  (bottom). 
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Figure  3.  Dimensions  of  the  three-frequency  aluminum  tubing  dipole  of  Example  2. 


tors,  and  the  larger  spacing  distances  for  the  desired  feedpoint  impedance.  Based  on  modeling 
and  experiments,  the  use  of  very  small  conductors  (less  than  0.0001  ?i)  is  not  recommended. 

Example  3:  A  14, 18, 21,  24,  28  MHz  Vertical  Dipole 

The  third  example  is  a  more  complex  antenna,  a  vertical  dipole  operating  on  five  frequencies 
covering  the  14,  18.068,  21,  24.89  and  28  MHz  amateur  bands.  This  antenna  is  illustrated  in 
Figure  4.  The  driven  element  is  a  modified  vertical  dipole,  with  an  inverted  “T”  for  the  lower 
half  to  reduce  the  overall  height.  The  lowest  conductor  is  installed  at  four  feet  above  ground, 
and  the  vertical  portion  extends  to  30  feet  above  ground.  The  feedpoint  is  eight  feet  above  the 
horizontal  portion,  and  the  feedline  is  run  inside  this  lower  section  of  tubing.  Above  the  feed- 
point,  the  driven  element  tapers  from  1-1/4-inch  diameter  to  5/8-inch  diameter.  Each  of  the 
additional  resonators  is  3/8-inch  diameter  tubing,  mounted  on  fiberglass  insulating  spacers. 

A  vertical  dipole  installed  close  to  ground  has  a  feedpoint  impedance  in  the  range  of  110 
ohms.  The  inverted  “T”  configuration  lowers  this  to  about  90  ohms.  But,  operation  in  a  50-ohm 
system  is  desired,  requiring  some  type  of  matching.  A  matching  section  of  75-ohm  transmission 
line,  80  degrees  electrical  length  at  14  MHz,  was  selected.  To  obtain  a  50-ohm  feedpoint  imped¬ 
ance  through  this  transmission  line  section,  the  antenna  impedance  should  be: 

14  MHz—  107.3  +jl7.0  Q 

18.068  MHz—  108.2  -jl5.9  Q 

21  MHz—  88.8  -j30.3  Q 

24.89  MHz—  66.6  -j27.6  Q 

28  MHz—  55.4  -jl3.8  Q 

The  selected  line  section  achieves  an  acceptable  match  at  14  MHz.  Using  the  flexibility  of 
the  coupled-resonator  method,  we  can  obtain  the  desired  antenna  impedances  at  the  other  fre¬ 
quencies  by  controlling  the  size,  spacing  and  length  of  each  resonator.  This  is  a  classic  conjugate 
matching  problem,  but  with  the  source  and  load  reversed  from  the  typical  situation.  The  proce¬ 
dure  starts  by  using  the  design  equation  to  determine  the  required  spacing  for  the  resistive 
component  of  the  impedance.  The  length  of  the  resonator  is  then  iteratively  changed  to  create 
the  inductive  or  capacitive  reactance.  Further  iteration  may  be  needed  to  optimize  the  spacing, 
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Figure  4.  Diagram  of  the  five-frequency  vertical  dipole  antenna  described  in  Example  3. 


since  the  resistive  component  of  impedance  also  changes  with  the  length.  As  built,  the  finished 
antenna  operates  with  a  VSWR  1.2:1  or  better  at  the  five  center  frequencies.  2:1  VSWE  band- 
widths  are  as  follows: 


14  MHz  — 
18.068  MHz  — 
21  MHz  — 
24.89  MHz  — 
28  MHz  — 


>350  kHz  (full  band) 
>100  kHz  (full  band) 
-350  kHz 

>100  kHz  (full  band) 
-450  kHz 


It  is  interesting  to  note  that  the  bandwidth  at  28  MHz  is  somewhat  greater  than  expected.  This 
is  probably  due  to  overlap  with  the  adjacent-frequency  24.89  MHz  resonator. 

All  coupled-resonator  elements  include  an  adjustable  section  at  the  bottom  of  the  antenna.  It 
should  also  be  noted  that  the  resonators  are  not  symmetrical  with  respect  to  the  feedpoint.  This 
does  not  cause  a  significant  variation  in  the  required  spacing,  compared  to  a  symmetrical 
design.  Finally,  the  18.068  MHz  resonator  is  not  a  continuous,  straight  conductor;  2.5  feet  of  its 
total  length  is  bent  at  90  degrees,  to  maintain  the  desired  overall  height.  This  changes  the  cou¬ 
pling,  and  the  required  spacing  was  reduced  from  that  required  for  a  resonator  coupled  over  its 
entire  length. 

Experimentation  was  needed  to  optimize  the  spacing  for  the  24.89  and  28  MHz  resonators. 
MININEC  modeling  indicated  that  they  should  be  spaced  7  and  8  inches,  respectively,  from  the 
driven  element.  After  construction,  these  spacings  were  reduced  to  6  and  7  inches  for  best  per¬ 
formance.  The  proximity  to  ground  is  the  likely  cause  for  the  deviation,  although  the  error  is 
quite  small,  given  the  limitations  of  MININEC  in  modeling  antennas  near  real  ground. 

Summary 

Three  antennas  using  the  coupled-resonator  antenna  design  technique  have  been  described. 
The  two  three-frequency  dipole  examples  offer  a  comparison  of  bandwidths  obtained  using  small 
and  large  diameter  conductors.  The  five-frequency  vertical  dipole  antenna  illustrates  how  this 
design  method  allows  antennas  to  be  built  with  arbitrary  impedances  at  each  frequency  of  oper¬ 
ation.  Below  is  a  summary  of  the  most  significant  characteristics  of  antennas  designed  using 
the  coupled-resonator  principle: 

•  Multiple  frequency  operation  with  low  VSWR  at  each  frequency 

•  No  reactive  components 

•  Control  of  impedance  at  each  frequency 

•  No  matching  network  required 

•  Independent  fine  tuning  at  each  frequency  (very  little  interaction) 

•  Some  mechanical  complexity 

•  Narrower  VSWR  bandwidth  than  a  simple  dipole  or  monopole 

Although  several  methods  are  available  for  the  construction  of  multi-frequency  antennas, 
the  coupled-resonator  technique  has  flexibility  as  its  principal  advantage.  Antennas  built  using 
this  method  are  electrically  simple,  requiring  no  reactive  components  such  as  traps,  stubs  or 
matching  networks.  In  cases  where  these  characteristics  are  judged  important,  the  technique’s 
mechanical  requirements  may  be  deemed  acceptable. 

Further  work  is  planned  to  develop  design  equations  that  encompass  a  wider  range  of 
impedances  and  antenna  dimensions,  as  well  as  including  effects  of  the  number  of  resonators  in 
the  system. 
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Brief  Notes  on  the  Construction  of  Practical  Coupled-Resonator  Antennas 

The  electrical  simplicity  of  coupled-resonator  antennas  comes  at  the  expense  of  mechanical 
complexity.  Multiple  conductors  must  be  supported  in  a  manner  that  maintains  a  consistent 
spacing  distance. 

Flexible  wire  conductors  require  either  a  large  number  of  spacers,  or  they  can  be  kept  under 
tension.  For  end-supported  dipole  configurations,  an  equalizer  assembly  can  be  fabricated  that 
will  maintain  tension  on  all  wires,  as  shown  in  Figure  A  below.  Swivel  connections  or  bearings 
at  the  ends  are  also  needed  to  avoid  twisting.  In  general,  wire  antennas  become  extremely  diffi¬ 
cult  to  manage  beyond  three  conductors. 

Rigid  conductors,  such  as  aluminum  tubing,  are  much  easier  to  use.  If  supported  at  the  cen¬ 
ter,  typically  two  additional  spacers  are  sufficient  to  prevent  excessive  motion  in  the  wind.  They 
should  be  placed  about  2/3  of  the  distance  from  the  center  to  the  end  of  the  resonator  element. 

Common  insulating  materials  with  sufficient  strength  include  fiberglass  and  ABS  plastic. 
Brittle  materials  like  acrylic,  or  materials  with  insufficient  insulating  properties  (e.g.,  wood), 
should  be  avoided.  If  the  antenna  is  to  be  permanently  installed  outdoors,  weatherproofing  and 
ultraviolet  protection  should  either  be  inherent  in  the  material  used,  or  accomplished  with 
paints  or  other  protective  coatings. 


Figure  A  Tension  can  be  equalized  among  wire  conductors  using  three-point  spacer  I  insulators. 
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THE  OPTIMIZED  WIDEBAND  ANTENNA  (OWA)  AND  ITS  APPLICATIONS 


J.  K.  Breakall  WA3FET 
Electrical  Engineering  Department 
The  Pennsylvania  State  University 
University  Park,  PA  16802 


Abstract 

This  paper  introduces  a  new  and  unique  optimized  wideband  antenna  (OWA)  which  has  very  wide  bandwidth 
derived  from  optimization  of  the  geometrical  configuration  of  the  structure.  The  resulting  antenna  looks  similar  to  the 
omventional  open-sleeve  dipole  or  monopole,  however,  it  is  different  in  that  modem  computer  optimization  methods  and 
nignrirtinK  were  used  to  achieve  the  best  ^obally  chosen  set  of  length  and  spacing  parameters  to  achieve  a  minimization 
(^tbe  peak  VSWR  in  a  selected  frequency  range.  This  paper  will  show  resulting  designs  which  are  far  superior  to  past 
configurations  of  this  structure,  and  in  some  cases,  better  than  any  possible  known  antenna  configurations. 

Introduction 

There  have  been  many  articles  and  papers  over  the  years  describing  different  types  of  antennas  and 
other  techniques  applied  to  conventional  antenna  structures  to  achieve  wide  bandwidth.  There  is  a 
universal  rule  of  nature  called  "The  Law  of  Conservation  of  Difficulty"  which  states  that  if  one  tries  to  push 
certain  intrinsic  qualities  e.g.  antenna  gain,  bandwidth,  smaller  size,  etc.,  another  desirable  parameter  or 
characteristic  will  have  to  give  (after  Prof  R.  Adler,  K3CXZ,  Naval  Postgraduate  School,  Monterey,  CA). 
An  example  is  that  as  an  antenna  is  made  smaller,  a  possible  desirable  goal,  then  the  bandwidth  and  gain 
may  sacrifice.  To  achieve  wider  bandwidth,  then  the  size  of  the  antenna  may  have  to  be  increased  in  certain 
of  its  dimensions,  i.e..  cage  dipole,  conical  monopole,  discone,  etc.  Also,  bandwidth  can  be  increased  at 
the  expense  of  efficiency  and  gain,  as  in  a  resistively  loaded  antenna  or  an  antenna  close  to  the  ground  with 
very  highly  coupled  earth  losses.  In  the  extreme,  a  dummy  load  can  be  considered  a  very  wideband  but 
certainly  an  extremely  inefficient  low-gain  antenna. 

Bandwidth  as  considered  here  will  be  defined  as  the  total  frequency  span  between  some  specified 
SWR  values  at  each  end  of  the  range.  Traditionally,  the  2: 1  points  have  been  used  for  specifying  bandwidth 
of  antennas.  One  could  as  well  specify  bandwidth  between  the  1.5:1  SWR  points  or  any  other  values 
appropriate  to  the  application.  In  all  cases,  the  antenna  with  the  higher  bandwidth  will  be  that  which  gives 
a  larger  frequency  span  while  insuring  all  SWR  values  in  the  total  span  are  below  some  maximum  specified 
level. 

Some  Common  Broadband  Antennas 

It  is  well  known  that  classical  dipole  and  monopole  (vertical)antennas  can  be  made  broadband  by 
simply  increasing  their  diameters.  This  becomes  quite  impractical  at  the  lower  HF  ham  bands  (especially 
160,  80,  and  40  meters)  where  antennas  with  1  foot  or  greater  diameter  tubing  would  certainly  be 
mechanical  nightmares.  Fortunately,  several  thin  wres  in  the  form  of  a  cage,  a  cone,  or  planar  shapes  (fan 
dipole)  can  actually  be  equivalent  to  solid  conductors  of  large  diameter.  They  can  have  much  larger 
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bandwidths  than  just  a  single  wire  dipole  with  the  same  diameter  as  those  of  the  individual  wires  making 
up  the  structure.  These  types  of  antennas  obtain  bandwidth  increases  due  to  their  inherent  geometrical 
shape  and  size. 

The  traditional  open-sleeve  antenna  has  been  known  and  used  since  its  invention  in  1946  by  Dr.  J. 
T.  Bolljahn  of  the  Stanford  Research  Institute,  which  now  is  the  present  SRI,  International*.  The  basic 
form  of  this  antenna  consists  of  a  base-fed  central  monopole  with  two  parallel  closely  spaced  and  coupled 
parasitic  elements,  one  on  each  side  of  the  central  monopole,  and  shorted  to  a  ground  plane  which  the 
central  monopole  is  fed  against.  The  length  of  the  central  monopole  is  approximately  1/4  wavelength  and 
that  of  the  parasitic  elements  is  about  half  of  the  central  monopole  length. 

All  of  the  previous  and  present  work  which  can  be  found  in  the  literature  on  this  antenna  structure 
has  been  done  by  choosing  the  various  geometrical  lengths  and  spacings  empirically  by  a  trial  and  error  or 
cut-and-try  methods  for  obtaining  the  desired  characteristics.  The  work  to  be  described  in  this  paper  has 
used  modem  computer  optimization  methods  and  algorithms  to  achieve  the  best  globally  chosen  set  of 
length  and  spacing  parameters  to  achieve  a  minimization  of  the  peak  SWR  in  a  selected  band.  The  various 
antenna  wires  or  cylindrical  tubing  are  modeled  usmg  a  very  sophisticated  computer  optimization  progr^, 
ISIEC-OPT^  which  is  based  internally  on  NEC.  As  will  be  seen,  the  results  have  produced  designs  which 
are  far  superior  to  past  configurations  of  this  structure  and  in  some  cases  better  than  any  possible  Imown 
antenna  configurations.  Furthermore,  some  of  the  optimized  designs  reported  here  are  used  in  achieving 
extremely  low  SWR  Oess  thanl .  1 : 1)  over  bandwidths  that  have  never  been  tried  to  the  authors'  knowledge 
before  with  this  open-sleeve  concept. 

Results 


Several  optimizations  have  been  performed  for  some  desired  characteristics  over  some  selected 
configurations  and  bands  of  frequencies.  The  first  is  to  produce  an  antenna  that  is  much  wider  in  SWR 
bandwidth  than  the  classical  dipole  antenna  for  a  chosen  frequency  range  of  6.7  to  7.6  MHz.  A  dipole  was 
modeled  over  typical  ground  conditions,  e,  =  13.  and  a  =  .005  Siemens/meter,  and  placed  at  a  high  of  35 
feet  above  the  ground.  The  wire  gauge  used  was  #14.  The  variables  of  the  optimization  were  the  length 
of  the  central  center-fed  dipole,  the  lengths  of  the  two  additional  parasitic  elements  (both  equal  length  for 
this  optimization),  and  the  spacing  between  each  wire  and  the  central  dipole  wire.  The  SWR  versus 
frequency  is  shown  in  Figure  1  and  can  be  seen  to  be  under  1.5  to  1  over  the  complete  range.  In  fact,  it 
is  about  1.2  to  1  fi-om  7.0  to  7.4  MHz,  or  a  total  span  of  some  400  KHz.  One  can  also  see  the  classical 
dipole  SWR  over  the  same  band  in  Figure  1  which  is  under  1.2  to  1  only  over  a  50  KHz  range. 

The  80  meter  band  is  one  where  the  bandvridth  of  the  OWA  should  be  very  useful.  A  globally 
constrained  optimization  was  performed  from  3.5  to  4.0  MHz  with  three  variables  of  length  of  the  central 
wire,  lengths  of  the  two  equal  parasitic  wires,  and  equal  spacing  between  the  parasitic  and  central  wires. 
The  diameter  was  wire  gauge  #14  which  was  held  constant  and  the  antenna  was  placed  at  a  height  of  50 
feet  over  the  same  average  ground  conditions  as  above.  The  NEC-OPT  results  are  shown  in  Figure  2  along 
with  the  results  for  a  conventional  dipole  at  the  same  50  feet  with  a  ZO  =  50  ohm  reference.  The  SWR  for 
the  OWA  is  under  1.5  over  the  complete  80  meter  band  with  fully  efficient  operation. 
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Results  are  shown  for  just  a  simple  two  wire  configuration  which  has  been  optimized  for  the  7.0 
to  7.3  MHz  band  of  fi’equendes.  This  configuration  was  obtained  by  letting  the  optimizer  choose  the  two 
lengths  of  the  wires,  the  spacing  between  the  wires,  and  the  radius  of  the  additional  parasitic  wire  vdth  the 
driven  wire  diameter  fixed.  There  were  4  variables  which  the  optimization  procedure  was  run  with.  The 
SWR  versus  fi’equency  results  are  shown  in  Figure  3  and  is  under  1. 1  to  1  over  the  complete  band  which 
is  a  remarkable  characteristic  not  achieved  by  any  other  known  antenna  vnth  such  a  simple  geometrical 
construction. 

3  Element  40  Meter  Yagi  OWA  Driven  Element 

One  of  my  colleagues,  Tim  Duffy,  K3LR,  is  fortunate  to  have  a  190  foot  tower  and  two  stacked 
3  element  40  meter  yagis  at  heights  of  100  and  190  feet.  These  were  custom  designs  fi-om  LTA,  Inc^ 
Duflfy  explained  that  the  bandvwdth  of  these  ya^s  is  pretty  sharp  being  lucky  to  achieve  200  kHz  bandwidth 
between  the  2: 1  SWR  points. 

The  first  step  to  produce  a  wideband  yagi  was  to  achieve  a  desired  Front-to-Sidelobe  level  at  the 
proper  fi'equency  range  by  adjusting  the  spacings  and  lengths  of  the  parasitic  elements  of  a  conventional 
3  element  yagi.  This  was  done  with  all  the  tools  available,  NEC-OPT,  YO,  and  AO*.  When  the  design  was 
finalized  for  both  electrical  and  mechanical  properties,  the  next  step  was  to  incorporate  the  OWA  into  the 
yagi  as  an  effective  driven  element.  NEC-OPT  was  used  to  find  the  optimum  spacing  and  lengths  of  the 
driven  element  and  closely  spaced  parasitic  element  that  form  the  OWA.  The  yagi  really  has  4  elements 
in  this  case,  however,  only  the  director  and  reflector  at  the  ends  of  the  boom  affect  the  pattern.  The  extra 
parasitic  element  very  close  to  the  driven  element  only  is  used  to  match  the  impedance  and  widen  the 
bandwidth  as  in  the  normal  OWA. 

The  amazing  tlnng  about  using  the  OWA  in  a  yagi  is  that  a  solution  can  be  found  not  only  for 
wideband  SWR  results,  but  the  impedance  match  can  be  selected  for  50  ohms  directly  with  no  form  of 
impedance  transformation  commonly  used  on  yagis,  such  as  gamma,  beta,  hairpin,  etc.  matches.  All  that 
has  to  be  done  is  to  use  a  direct  connecting  1 :1  current  type  or  equivalent  balun  and  the  antenna  is  matched. 
This  is  really  a  remarkable  finding  and  should  make  the  other  forms  of  matching  obsolete.  SWR  response 
is  shown  in  Figure  4  and  one  can  clearly  observe  the  extremely  wide  and  unheard  of  bandwidth  that  this 
feed  has.  This  is  at  the  antenna  feedpoint  and  even  better  response  will  be  found  at  the  end  of  the  coax 
feedline  at  the  transmitter.  The  Front-to-Sidelobe  level  of  this  design  is  greater  than  15  dB  across  the 
complete  40  meter  band  and  peaks  at  37  dB  at  7.05  MHz. 

Other  yagi  designs  have  been  optimized  for  15  and  10  meters  for  both  Front-to-Sidelobe  levels  and 
SWR  bandwidth.  Results  of  these  designs  are  shown  in  Figure  5  and  6. 

CONCLUSIONS 

As  mentioned  above,  there  are  still  many  more  combinations  which  can  be  tried  and  tested.  It  only 
takes  setting  up  the  cases  and  performing  the  long  computer  runs. 

Construction  and  application  of  the  OWA  is  probably  best  suited  and  visualized  in  the  high 
jfrequency  (HF)  and  higher  frequencies.  The  antenna  can  actually  be  made  of  wire  or  tubing  with  just  a  very 


35 


simple  feeding  structure.  A  matching  transformer  or  other  method  may  be  required  to  provide  the  correct 
and  best  matching  impedance  to  the  feedpoint.  This  antenna  should  have  application  well  up  into  the  GHz 
regions. 
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Figure  3.  40  Meter  OWA  Dipole,  Simple  2  Wire  Configuration 
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Figure  4.  4  Element  40  Meter  Yagi,  OWA  Drive  Element 
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Figure  5.  6  Element  15  Meter  Yagi,  OWA  Driven  Element 


The  Quad-Rhomb  Antenna 

A  New  AH-Band  Antenna  for  Amateur  Radio  Applications 


Rudiger  Anders,  AA2HT 
Applied  Electromagentics  Engineering 
32  Parkgate  Drive 
Atlanta,  GA  30328 


Introduction 

Covering  all  HE  bands  between  160m  and  10m  with  only  one  antenna  is  one  of  the  objectives  of  new¬ 
comers  to  Amateur  Radio  and  old-timers  as  well.  Although  not  easily  attainable  with  a  rotatable  YAGI- 
UDA  beam  this  objective  can  be  achieved  for  a  fixed  direction,  for  instance  working  Europe  from  the 
US,  while  providing  omnidirectional  coverage  for  domestic  and  NVIS  [1]  operations  on  the  low  bands. 

The  73m  rectangular  loop  antenna 

The  Quad-Rhomb  is  derived  from  the  horizontally  supported  85.1m  W9LZX  Lazy  Quad  [2],  which 
operates  as  a  full  wave-  or  quad  loop  on  the  80/75m  band,  with  similar  behavior  on  160m  and  40m.  If 
fed  in  one  corner  the  Lazy  Quad  behaves  like  a  small,  non-term inated  rhombus  antenna  on  the  upper  HE 
bands,  with  the  main  lobe  oriented  along  the  diagonal  through  the  feed  point  corner.  Dimensions  of  the 
Lazy  Quad  are  usually  chosen  such  that  it  resonates  on  all  of  the  harmonically  related  160m  through  10m 
bands.  This  common  practice  however  trades  ease  of  matching  for  desirable  low  sidelobe  levels  and  par¬ 
ticularly  compromises  the  near  side  lobe  level  and  gain  on  10m,  as  shown  in  Eig.  1. 


Fig.  1  Horizontal  farfield  (elevation  12“)  of  the  85.1m  Lazy  Loop,  at  15m  height  over  average  ground 
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The  overall  sidelobe  behavior  can  be  improved  considerably  if  the  resonance  requirement  is  dropped  and 
the  loop  circumference  reduced  to  73.0m/239.5ft  from  85.1m/279.2ft.  Also,  the  half  power  beam  width 
increases  favorably  on  the  upper  HF  bands  and  the  10m  gain  figure  reaches  that  of  the  12m  band.  The 
reduction  of  the  loop  side  length  by  3,1m  to  18.25m,  saves  about  25%  lot  space,  a  welcome  bonus  for 
the  space  limited  operator. 

The  modified  73m  rectangular  loop  or  Quad-Rhomb  antenna  (QR  60/30) 

An  even  more  dramatic  improvement  of  the  73m  loop  can  be  achieved  through  controlling  the  current 
distribution  along  the  loop  the  with  a  closed  stub  (Zi=450Ohm)  that  is  put  in-line  with  the  loop  at  the 
corner  opposite  to  the  feed  point,  as  depicted  in  Fig.  4.  With  a  stub  length  of  8.9m/29.2ft  (v=l,0)  opti¬ 
mized  [3]  for  minimum  sidelobe  levels  on  the  DX  bands  the  Quad-Rhomb  features  the  desired  horizontal 
radiation  patterns  shown  in  Fig.  2  with  the  vertical  patterns  for  the  QR  60/30  (Quad-Rhomb,  approx.  60ft 
side  length  and  approx.  30ft  stub  length)  at  15.25m  (50.0ft)  given  in  Fig  3. 


Fig.  2  Horizontal  farfield  (elevation  12’)  of  the  QR  60/30,  at  15m  height  over  average  ground 


Fig.  3  Vertical  farfield  (azimuth  0°)  of  the  QR  60/30,  at  15m  height  over  average  ground 
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Lightning  protection 

Protection  against  common  mode  lightning  discharge  can  easily  be  established  by  ground^  die  center 
of  the  stub  end.  Differential  mode  protection  requires  a  gas  discharge  device  across  the  feed  point  comer. 


Fig.  4  Perspective  view  of  the  geometry  of  the  QR  60/30 
System  Resonance  and  Matching 

The  Quad-Rhomb,  if  fed  by  a  balanced  transmission  line  of  33.85m/lllft  (v=1.0)  will  resonate  on  all 
bands,  Fig.  5,  and  with  the  exception  of  160m  features  an  input  resistance  around  lOOOhms.  A  2:1  hybrid 
balun  at  die  station  end  of  the  33.85m/lllft  transmission  line  may  be  used  to  down-match  to  50Ohms. 
For  other  line  lengths  an  ATU  is  recommend. 


Fig.  5  Perspective  view  of  the  geometry  of  the  QR  60/30  including  33.85m  balanced  feed  line 
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Fig.  6  Input  impedance  of  the  QR  60/30,  at  15m  height  over  average  ground 

For  a  better  grasp  of  the  farfield  characteristic  Fig.  7,  Fig.  8,  and  Fig.  9  provide  perspective  views  of  the 
3-D  pattern  at  two  charcteristic  frequencies. 


Fig.  7  3-D  farfield  of  the  QR  60/30,  at  15m  height  over  average  ground,  f=3.8  MHz 
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Fig.  8  3-D  farfield  of  the  QR  60/30,  at  15m  height  over  average  ground,  f=14.15  MHz 


Fig.  9  3-D  farfield  of  the  QR  60/30,  at  15m  height  over  average  ground,  f=24.95  MHz 
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Conclusion 

A  novel  all-band  antenna  has  been  presented  that  features  quasi-omnidirectional  coverage  on  the 
low  ham  bands  (160m,  80m,  and  40m)  and  highly  directive,  low  sidelobe,  bi-directional  radiation 
patterns  on  the  upper  HF  DX  bands  (30m  to  10m).  Due  to  the  closed  loop  design  which  is  known 
for  an  insensitivity  to  rain  static  and  corona  effects  the  Quad-Rhomb  antenna  presents  itself  as  very 
quiet  listening  antenna,  much  appreciated  among  DX  operators. 
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Introduction 

Quite  a  number  of  software  packages  are  currently  available  to  help  the  DX  enthusiast  with  forecasting 
or  predicting  HE  skywave  propagation.  Why  then  another  program?  Unfortunately  the  majority  of  the 
software  caters  to  the  needs  of  the  expert  user  who  already  knows  the  ins  and  outs  of  HE  propagation, 
leaving  the  newcomer  out  in  the  cold.  While  these  products  provide  reasonable  results  for  the  MUE,  the 
traffic  probability,  and  in  some  cases  the  signal  levels  to  be  expected,  they  offer  very  few  options  to  sys¬ 
tematically  explore  the  response  to  the  various  input  parameters  in  order  to  gain  insight  into  the  short- 
and  long-term  behavior  of  skywave  propagation.  It  is  particularly  desirable  for  newcomers  and  the  occa¬ 
sional  DXer  to  improve  the  basic  understanding  of  skywave  propagation  through  interactively  exploring 
the  response  of  a  computer  propagation  model  in  a  much  shorter  time  period  than  years  of  observation 
can  do. 


Objectives 

SKY-WAVES-95  has  been  designed  with  the  emphasis  on  self  study  by  providing  quick  answers  to  com¬ 
mon  questions  like: 

How  does  the  MUE  fluctuate  annually  for  the  same  circuit? 

When  does  E-layer  blanketing  render  the  E2-layer  ineffective? 

Does  the  operating  fi-equency  have  an  influence  on  the  MUE? 

Why  is  there  only  one  hop  mode  at  band  opening  or  closing? 

How  does  the  antenna  height  affect  the  length  of  a  band  opening? 

while  at  the  same  time  addressing  the  needs  of  the  pros.  SKY-WAVES-95  is  constantly  being  checked 
and  evaluated  against  daily  propagations  into  various  parts  of  the  world. 


Interactive  variables 


terminal  coordinates 
local  time  (or  UTC) 
solar  flux 

probability  mode(EOT,  MUE,  HPF) 
bearing  (range  mode) 
path  (short,  long) 


operating  frequency 
date 
K  index 

height  of  antenna,  stack  height 
grayline  (morning,  evening) 
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Basic  Model 


SKY'WAVES-95  is  based  in  part  on  the  critical  frequency  fOf2  model  of  the  improved  MINIMUF-85 
[1],  in  conjunction  with  the  LOCKWOOD  approximations  for  the  m(d)F2  and  m(d)E  range  factors  [2], 
which  account  for  an  underlying  ionization  of  the  E-layer  and  the  LOCKWOOD  approximation  for  the 
mirroring  height[3].  Ionospheric  path  incorpotates  the  BARGHAUSEN  D-layer  attenuation[4],  and  the 
DAVIS  mode  attenuation[5].  Rather  than  through  control  points  SKY-WA^S-95  utilizes  the  actual 
location  of  the  ionospheric  refraction  areas  and  Fresnel  ground  reflection  regions  for  path  evaluation.  In 
combination  with  a  novel  iteration  technique  it  becomes  possible  to  account  for  the  diurnal  and  geo¬ 
graphical  variations  of  the  F2-layer  mirroring  height  along  the  propagation  path.  As  a  results  the  takeoff 
the  angles  and  the  band  opening  and  closing  schedules  become  more  accurate  and  the  predicted  relative 
path  attenuation  more  realistic. 

Currently  SKY-WAVES-95  presents  the  results  in  tabular  form  Fig.l,  and  in  terminal  graphics  Fig.2.  In 
order  to  better  and  systematically  understand  the  sun  related  diurnal  and  seasonal  nature  of  the  physics 
of  propagation  the  results  are  represented  in  terms  of  the  local  mean  time  at  the  circuit  terminals  rather 
than  UTC. 


Examples  and  Output  Samples 


TX  qth: 
RX  qths 


JAN  8  1996,  LT1= 
TX  boom  height= 
RX  boom  height= 


RX  take-on  angle= 


00  MHz 
10.06“ 
11.53“ 


ltl= 

0.00 

utc=  5.00 

lt2=  6.00 

ltl= 

1.00 

utc=  6.00 

lt2=  7.00 

ltl= 

2.00 

utc=  7.00 

lt2=  8.00 

ltl= 

3.00 

utc=  8.00 

lt2=  9.00 

ltl= 

4.00 

utc=  9.00 

lt2=10.00 

ltl= 

5.00 

utc=10.00 

lt2=11.00 

ltl= 

6.00 

utc=ll . 00 

lt2=12.00 

-  SKY-WAVES-95  - 

Atlanta  33,45“n  84.23°W  antenna  bearing  ->  40.24“/N 

Dassendorf  53.32°N  10.22*E  ground  distance  ->  7345.18  Km 

-  (C)  AA2HT  1995  - 

•varies,  UTC=varies,  sfx=111.00,  ssn=  55,  K=  0,  freq=  14. 

30.00  m,  TX  stack  height=  7.90  m,  TX  take-off  angles 
26.00  m,  RX  stack  height=  0.00  m,  ' 

short  path  (MUF)  — 

h=2  MUF=10.19  MHz  elv=  2.04'  att=  81.85  dB 

h=2  MUF=10.18  MHz  elv=  1.46“  att=  88.34  dB 

h=2  MUF=10.20  MHz  elv=  1.25“  att=  94.78  dB 

h=2  MUF=10.39  MHz  elv=  1.25°  att=  60.73  dB 

h=2  MUF=10,28  MHz  elv=  1.01“  att=  68.35  dB 

h=3  MUF=  9.53  MHz  elv=  6.54°  att=  51.99  dB 

h=3  MUF=12.76  MHz  elv=  7.27°  att=  8.99  dB 

•wv - vv - vv - 


ltl=  7.00 
ltl=  8.00 
ltl=  9.00 
ltl=10.00 
ltl=11.00 
ltl=12.00 
ltl=13.00 


Utc=12.00 
utc=13.00 
utc=14.00 
Utc=15.00 
Utc=16.00 
Utc=17 .00 
Utc=18.00 


lt2=13.00 

lt2=14.00 

lt2=15.00 

lt2=16.00 

lt2=17.00 

lt2=18.00 

lt2=19.00 


h=4  MUF=16.91 
h=4  MUF=19.26 
h=4  MUF=19.32 
h=3  MUF=22.21 
h=4  MUF=17.74 
h=4  MUF=15.99 
h=3  MUF=16.11 


MHz  elv=12.41° 
MHz  elv=12.36° 
MHz  elv=12.33° 
MHz  elv=  7.10° 
MHz  elv=12.26“ 
MHz  elv=12.41° 
MHz  elv=  7.28° 


att=  2.52  dB 
att=  2.43  dB 
att=  5.88  dB 
att=  8.47  dB 
att=  9.53  dB 
att=  9.86  dB 
att=  8.68  dB 


ltl=14.00 
ltl=15.00 
ltl=16.00 
ltl=17.00 
ltl=18.00 
ltl=19.00 
ltl=20.00 
ltl=21.00 
ltl=22.00 
ltl=23.00 
ltl=  0.00 


Utc=19.00 
Utc=20 .00 
utc=21.00 
utc=22.00 
utc=23.00 
utc=  0.00 
utc=  1.00 
utc=  2.00 
utc=  3.00 
utc=  4.00 
utc=  5.00 


lt2=20.00 
lt2=21.00 
lt2=22.00 
lt2=23.00 
lt2=  0.00 
lt2=  1.00 
lt2=  2.00 
lt2=  3.00 
lt2=  4.00 
lt2=  5.00 
lt2=  6.00 


h=3 

MUF=12.36 

MHz 

elv= 

7.42“ 

att=  17.34 

dB 

h=3 

MUF=11.54 

MHz 

elv= 

8.02° 

att=  21.73 

dB 

h=3 

MUF=10.84 

MHz 

elv= 

8.28° 

att=  27.70 

dB 

h=2 

MUF=11.79 

MHz 

eiv= 

2.05* 

att=  35.95 

dB 

h=2 

MUF=11.26 

MHz 

elv= 

2.53° 

att=  36.65 

dB 

h=2 

MUF=10.90 

MHz 

elv= 

2.53° 

att=  42.25 

dB 

h=2 

MUF=10.63 

MHz 

elv= 

2.50° 

att=  50.65 

dB 

h=2 

MUF=10.43 

MHz 

elv= 

2.44° 

att=  59.25 

dB 

h=2 

MUF=10.30 

MHz 

elv= 

2.33° 

att=  67.59 

dB 

h=2 

MUF=10.22 

MHz 

elv= 

2.30° 

att=  73.73 

dB 

h=2 

MUF=10.19 

MHZ 

elv= 

2.04° 

att=  81.85 

dB 

Fig.  1  Tabular  form  of  24  hour  MUF  and  relative  attenuation  distribution 
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- SKY-WAVES- 95 - - 

TX  qth;  Atlanta  33.45°N  84.23°W  antenna  bearing  ->  40.24VN 

RX  qth:  Dassendorf  53.^2°N  10.22°E  ground  distance  ->  7345.18  Km 

-  (c)  AA2HT  1995  - 

JAN  8  1996,  LTl=varies,  UTC=varies,  sfx=111.00,  ssn=  55,  K=  0,  freq=  14.00  MHz 
TX  boom  height=  30.00  m,  TX  stack  height=  7.90  m,  TX  take-off  angle=  10.06° 

RX  boom  height=  26.00  m,  RX  stack  height=  0.00  m,  RX  take-on  angle=  11.53° 

-  short  path  (MUF)  - 

6h  8h  lOh  12h  14h  16h  18h  20h  22h  0  LT2  2h  4h  6h 

3  0  2  0 

MHz  I  deg 


10  222222 
0  LTl 


2 

°  22 

.0 

22222222222-°°° 
2h  4h  6h 


22222 

°°°°°222222 

°°°  2222222222 
— + — + — + — + — + — + — +-222222222 
14h  16h  18h  20h  22h 


Fig.  2  Terminal  graphic  of  24  hour  MUF  distribution  (winter) 
numerals  indicate  hop  numbers,  °  indicate  takeoff  angle 


TX  qth: 
RX  qth: 


JAN  8  19! 
TX  boom  he 
RX  boom  he 


6h 

0  + — +- 
dB  I 


60  +— +— 
0  LTl 


-  SKY-WAVES-95  - 

Atlanta  33.45°N  84.23°W  antenna  bearing  ->  40. 

Dassendorf  53.32°N  10.22°E  ground  distance  ->  7345. 

-  (c)  AA2HT  1995  - 

6,  LTl=varies,  UTC=varies,  sfx=111.00,  ssn=  55,  K=  0,  freg=  14.0 
ight=  30.00  m,  TX  stack  height=  7.90  m,  TX  take-off  angle=  1 
ight=  26,00  m,  RX  stack  height=  0.00  m,  RX  take-on  angle=  1 

-  short  path  (MUF)  - 

h  lOh  12h  14h  16h  18h  20h  22h  0  LT2  2h  4h 


3  °°°° °°  "44434444443333 
3  °  ..  coc  ,  33 

3  °  °  °  °  333 

23°  .  °.°  °  .333  .... 

2  °  .  .  333 

2  °  .  .  .......33 

“  °°°2°°  °  °°°  322 

“  °  °2°  °  222  22 
°  .  °  32°  .  .  ,  .  .°  222222  . 

"  °  3°  °  222 
°  °  3°  °  .«».»22°°°°°°° 

°  “333°  °°°”  22 

°°°°°3  °  222 
-22— +23+— +—+—+—+—+—+—+—+—+—+—+—+“+— +-+—22-+— 
ti  4h  6h  8h  lOh  12h  14h  16h  18h  20h  22h 
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-  SKy-WAVES-95  - 

TX  qth;  Atlanta  33,45'N  84.23°W  antenna  bearing  ->  40.24VN 

RX  qth:  Dassendorf  53.32°N  10.22°E  ground  distance  ->  7345.18  Km 

-  (C)  AA2HT  1995  - 

AUG  1  1996,  LTl=varies,  UTC=varies,  sfx=100.00,  ssn=  42,  K=  0,  freq=  14.00  MHz 
TX  boom  height=  30.00  m,  TX  stack  height^  7.90  m,  TX  take-off  angle=  10.06° 

RX  boom  height®  26.00  m,  RX  stack  height®  0.00  m,  RX  take-on  angle®  11.53° 

— - short  path  (MUF)  - - 

6h  8h  lOh  12h  14h  16h  18h  20h  22h  0  LT2  2h  4h  6h 


0  +- 
dB 


2222 

20  +  22 
222 


222  22°' 
2222° 


_+ — + — + — + — + — + — + — + — + — + — + — + — + — + — + — + — + — + — + — + — +  15 

deg 

22222  2222 

22  »»»»"•'>«'>"<><■•»•  22222  222222  22  2222222222 
2222222222222222222  22222  222222  22 
.222  .°  .  .  2222  °  .  .  .  .  .  +10 
22 


0  LTl  2h  4h  6h  8h  lOh  12h  14h  16h  18h  20h  22h  24h 


Fig.  4  Terminai  graphic  of  24  hour  MUF  distribution  (summer) 
numerals  indicate  hop  numbers,  °  indicate  takeoff  angle 


TX  qth: 
RX  qth; 


antenna  bearing  -> 
ground  distance  -> 


40.24°/N 
7345.18  Km 


-  SKY-WAVES-95  - 

Atlanta  33.45°N  84.23°W 

Dassendorf  53.32°N  10.22°E 

-  (c)  AA2HT  1995  • 

AUG  1  1996,  LTl=varies,  UTC=varies,  sfx=100.00,  ssn=  42,  K=  0,  freq=  14.00  MHz 
TX  boom  height®  30.00  m,  TX  stack  height®  7.90  m,  TX  take-off  angle®  10.06° 

RX  boom  height®  26.00  m,  RX  stack  height®  0.00  m,  RX  take-on  angle®  11.53° 

-  short  path  {MUF)  - 

6h  8h  lOh  12h  14h  16h  18h  20h  22h  0  LT2  2h  4h  6h 

25  +—+—+—+—+—+—+—+—+—+—+—+—+—+—+—+—+—+—+—+—+—+—+—+—+  15 
MHz  I  [deg 


15  +  .  °° 

>°  “o 

2222 

I  2222222 


°°  22222222222 
°°  2222222  2 
2222  22 

.  .22  .  .  .  22222222222222°  .  2222 

- 22 - °  °2222°°°  °°2222-°2222< 

222  2222222  °  22 

22222  I 


10  + — + — +— 
0  LTl  2h 


lOh  12h  14h  16h  18h  20h  22h 


Fig.  5  Terminal  graphic  of  24  hour  MUF  distribution  (summer) 
numerals  indicate  hop  numbers,  °  indicate  takeoff  angle 
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-  SKy-WAVES-95  - 

TX  qth:  Atlanta  33.45'’H  84.23'W 

RX  qth:  Dassendorf  53.32''N  10.22''E 

-  (c)  AA2HT  1995  — 

JAN  8  1996,  LT1=  8.00,  UTC=  13.00,  sfx=111.00, 
TX  boom  height=  30.00  m,  TX  stack  height=  7.90 
RX  boom  height=  26.00  m,  RX  stack  height=  0.00 

short  path  (MUF) 


antenna  bearing  ->  40.24°/N 

ground  distance  ->  7345.18  Km 

ssn=  55,  K=  0,  freq=  14.00  MHz 
TX  take-off  angle=  10.06° 
RX  take-on  angle==  11.53° 


-> 

-> 

-> 

-> 

-> 

-> 

-> 


hop{s) , 
hop{s) , 
hop(s), 
hop ( s ) , 
hop{s) , 
hop { s ) , 
8  hop ( s ) , 


MUF  ->  25.34  MHz, 
MUF  ->  22.47  MHz, 
MUF  ->  19.26  MHz, 
MUF  ->  16.52  MHz, 
MUF  ->  14.54  MHZ, 
MUF  ->  13.16  MHz, 
MUF  ->  12.08  MHz, 


elevation  ->  0 
elevation  ->  7 
elevation  ->  12 
elevation  ->  17 
elevation  ->  22 
elevation  ->  27 
elevation  ->  31 


-  F2 -layer  PENETRATION  for  elevations  >36 


,16°,  signal 
.10°,  signal 
.36°,  signal 
,27°,  signal 
,13°,  signal 
.25°,  signal 
.41°,  signal 
,01°,  signals 


loss  ->  57.18  dB 
loss  ->  4.24  dB 
loss  ->  2.43  dB 
loss  ->  12.07  dB 
loss  ->  30.28  dB 
loss  ->  28.94  dB 
loss  ->  82.13  dB 
lost  in  space  — 


Fig.  4  Hop  statistic  for  14MHz  operating  frequency 


-  SKY-WAVES-95  - 

TX  qth:  Atlanta  33.45°N  84.23°W  antenna  bearing  ->  40.24°/N 

RX  qth:  Dassendorf  53.32°N  10.22°E  ground  distance  ->  7345.18  Km 

-  (C)  AA2HT  1995  - 

JAN  8  1996,  LT1=  8.00,  UTC=  13.00,  sfx=111.00,  ssn=  55,  K=  0,  freq=  25.00  MHz 
TX  boom  height=  30.00  m,  TX  stack  height=  7.90  m,  TX  take-off  angle=  5, 38° 

RX  boom  height=  26.00  m,  RX  stack  height=  0.00  m,  RX  take-on  angle=  6.37° 

-  short  path  (MUF)  - 

->  2  hop(s),  MUF  ->  25.48  MHz,  elevation  ->  2.27°,  signal  loss  ->  9.49  dB 

->  3  hop(s),  MUF  ->  22.45  MHz,  elevation  ->  9.13°,  signal  loss  ->  25.68  dB 

-  F2-iayer  PENETRATION  for  elevations  >  13.59°,  signals  lost  in  space  — 


Fig.  5  Hop  statistic  for  25Mhz  operating  frequency 


-  SKY-WAVES-95  - 

TX  qth:  Atlanta  33.45°N  84.23°W  antenna  bearing  ->  40.24°/N 

RX  qth;  Dassendorf  53.32°N  10.22°E  ground  distance  ->  7345.18  Km 

-  (c)  AA2HT  1995  - 

JAN  8  1996,  LT1=  8.00,  UTC=  13.00,  sfx=111.00,  ssn=  55,  K=  0,  freq=  28.00  MHz 
TX  boom  height^  30.00  m,  TX  stack  height=  7.90  m,  TX  take-off  angle=  5.02° 

RX  boom  height=  26.00  m,  RX  stack  height=  0.00  m,  RX  take-on  angle=  5.55° 

-  short  path  (MUF)  - 

->  2  hop(s),  MUF  ->  25.34  MHz,  elevation  ->  3.03°,  signal  loss  ->  15.94  dB 

»»>»»»»»»»»»»  antenna  height  less  than  optimal  <<<<<<<<<<<<<<<<<<<< 
-  F2-layer  PENETRATION  for  elevations  >  9.13°,  signals  lost  in  space  — 


Fig.  6  Hop  statistic  for  28MHz  operating  frequency 
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-  SKY -WAVES -95  - 

TX  qth:  Atlanta  33.45°N  84.23°W  antenna  bearing  ->  40.24 °/N 

RX  qth:  Dassendorf  53.32“N  10.22°E  ground  distance  ->  7345.18  Km 

-  (c)  AA2HT  1995  - 

JAN  8  1996,  LT1=  17.43,  UTC=  22.43,  sfx=100.00,  ssn=  42,  K=  0,  freq=  14.00  MHz 
TX  boom  height=  30.00  m,  TX  stack  height=  7.90  m,  TX  take-off  angle=  5.02° 

RX  boom  height=  26.00  m,  RX  stack  height=  0.00  m,  RX  take-on  angle=  5.55° 

— -  grayline  bearings  ->  334.04°  and  154.04°  - 

hp=l  range=  1000.0  Km  trav=  1263.7  Km  MUF=  11.35  MHz  elv=33.16°  att=  22.64  dB 

hp=l  range=  2000.0  Km  trav=  2148.4  Km  MUF=  17.67  MHz  elv=13.01°  att=  1.03  dB 

hp=l  range=  3000.0  Km  trav=  3128.4  Km  MUF=  21.48  MHz  elv=  4.41°  att=  8.24  dB 


hp=2 

range=  4000.0 

Km 

trav=  4257.5 

Km 

MUF= 

15.42 

MHz 

elv=11.47° 

att= 

0.43 

dB 

hp=2 

range=  5000,0 

Km 

trav=  5237.0 

Km 

MUF= 

16.11 

MHz 

elv=  7.07° 

att= 

2.94 

dB 

hp=2 

range=  6000,0 

Km 

trav=  6225.6 

Km 

MUF= 

17.35 

MHZ 

elv=  3.45° 

att= 

11.71 

dB 

hp=2 

range=  7000.0 

Km 

trav=  7225.0 

Km 

MUF= 

18.13 

MHz 

elv=  1.06' 

att= 

32.23 

dB 

hp=3 

range=  8000.0 

Km 

trav=  8329.0 

Km 

MUF= 

15.88 

MHz 

elv=  5,30° 

att= 

6.17 

dB 

hp=3 

range=  9000.0 

Km 

trav=  9328.1 

Km 

MUF= 

17.46 

MHz 

elv=  3.27° 

att= 

13.10 

dB 

hp=3 

range=10000 . 0 

Km 

trav=10326.2 

Km 

MUF= 

17.93 

MHz 

elv=  1.44° 

att= 

24.49 

dB 

hp=3 

range=11000 . 0 

Km 

trav=11342.0 

Km 

MUF= 

18.39 

MHz 

elv=  0.17° 

att= 

55.98 

dB 

hp=4 

range=12000 .0 

Km 

trav=12465.7 

Km 

MUF= 

17.35 

MHz 

elv=  3.44° 

att= 

11.95 

dB 

hp=4 

range=13000 .0 

Km 

trav=13441.2 

Km 

MUF= 

17.71 

MHz 

elv=  2.36° 

att= 

17.82 

dB 

hp=4 

range=14000 . 0 

Km 

trav=14515.5 

Km 

MUF= 

17.90 

MHz 

elv=  1.29° 

att= 

27.16 

dB 

hp=4 

range=15000 . 0 

Km 

trav=15503.1 

Km 

MUF= 

18.33 

MHz 

elv=  0.30° 

att= 

45.80 

dB 

hp=5 

range=16000 .0 

Km 

trav=16651 . 6 

Km 

MUF= 

17.51 

MHz 

elv=  3.17° 

att= 

14.03 

dB 

hp=5 

range=17000 . 0 

Km 

trav=17621.3 

Km 

MUF= 

16.02 

MHZ 

elv=  2,24° 

att= 

19.19 

dB 

hp=5 

range=18000 .0 

Km 

trav=18652.3 

Km 

MUF= 

15.01 

MHz 

elv=  1.30° 

att= 

26.99 

dB 

hp=5 

range=19000.0 

Km 

trav=19759.0 

Km 

MUF= 

14.58 

MHz 

elv=  1,06° 

att= 

32.44 

dB 

hp=6 

range=2 0000.0 

Km 

trav=2 0899.1 

Km 

M0F= 

14.19 

MHz 

elv=  3.17° 

att= 

14.03 

dB 

Fig.  7  grayline  statistic  (evening)  for  14MHz  operating  frequency 


SKY-WAVES-95  is  available  from  the  author  (free  of  charge) . 

References 

[1]  D.  B.  Sailors,  R.  A.  Sprague,  W.  H.  Rix,  "MINIMUF-85:  An  Improved  HF  MUF  Prediction  Algo¬ 
rithm",  Naval  Ocean  System  Center,  San  Diego,  TR  1121, 1986. 

[2]  M.  Lockwood,  "Simple  M-factor  algorithm  for  improved  estimation  of  the  basic  maximum  usable 
frequency  of  radio  waves  reflected  from  the  ionospheric  F-region",  IRE  Proc.,  Vol.  130,  Pt.  F,  No. 
4,  June  1983,  pp.  296-302. 

[3]  M.  Lockwood,  "Simplified  estimation  of  ray-path  mirroring  height  for  HF  radio  waves  reflected 
from  the  ionospheric  F-region",  IRE  Proc.,  Vol.  131,  Pt.  F,  No.  2,  April  1984, 117-124. 

[4]  A.  F.  Barkhausen  et  al.,  "Predicting  Long-Term  Operational  Parameters  of  High-Frequency  Sky- 
Wave  Telecommunication  Systems",  ESSA,  TR  ERL  110-ITS  78,  1969,  pp.  61-64. 

[5]  K.  Davies,  "Ionospheric  Radio",  Peter  Peregrinus  Ltd.,  1990,  p.  459. 


51 


USING  HAM  RADIO  CEM  CODES  TO  GAIN  INSIGHT  INTO 
VHF  GROUND  PLANE  ANTENNA  PERFORMANCE 
AND  TO  MITIGATE  75  METER  MARS  RFI 
AT  A  NAVAL  RECEIVING  SITE 

Art  Hoffman,  WA6TVN 
R.  W.  Adler,  K3CXZ 
Naval  Postgraduate  School 

ANTENNA  DESIGN 

The  authors  have  been  users  of  the  original  MININEC  in  solving  for  VHF/UHF  antenna 
performance  for  over  15  years.  Serious  use  began  with  the  release  of  the  MININEC  SYSTEM  for  the 
design  of  two  and  three-element  yagis  with  direct  coax  feed  for  the  10  and  2-meter  bands.  The  boom  was 
used  as  a  balun  and  was  included  in  the  model  since  it  extended  beyond  the  back  end  of  the  array. 
Designs  were  constructed  from  non-magnetic  stainless  steel  brewery  tubing  or  from  copper  water  pipe. 
SWR  measurements  closely  tracked  the  MoM  results.  Operational  beamwidth  was  as  predicted  but  the 
front-to-back  ratio  was  dependent  on  nearby  local  obstructions,  as  expected.  During  this  time,  MN  and 
ELNEC  became  available  and  were  pressed  into  service. 

In  1984,  2-meter  activity  flourished  locally  and  new  hams  were  in  need  of  simple  home-brew 
antennas  for  home  use  with  their  handie-talkies.  The  venerable  standard  quarter-wave  (QW)  4-radial 
ground  plane  5-wire  design  (Figure  1)  looked  like  something  simple  to  model  and  it  could  be  built  from 
inexpensive  readily  available  materials.  MININEC  models  correctly  identified  element  lengths  and  input 
impedance,  but  radiated  performance  was  not  as  satisfying  as  expected.  It  was  suggested  by  CDR  Buzz 
Shaw,  WAINHP,  that  a  set  of  careful  measurements  should  be  made  for  comparison  to  a  known  basic 
antenna  such  as  a  vertical  half-wave  dipole  reference  antenna.  (Buzz  had  designed  a  version  of  a  J-pole 
antenna,  for  his  MS  thesis  project  in  1974,  before  MININEC  was  available,  that  featured  a  direct  feed 
from  a  coax  cable.)  The  comparison  of  the  dipole  to  a  4-radial  ground  plane  (GP)  revealed  a  serious 
shortfall  in  gain  (as  much  as  -3  dB)  of  the  GP.  A  second  comparison  was  made  to  a  coaxial  half-wave 
vertical  dipole.  The  coax  dipole  seemed  to  suffer  the  same  degradation  as  the  GP,  showing  about  a  2  dB 
difference.  The  possibility  of  a  common  upset  was  looming. 

Ten  years  earlier  we  noticed  the  same  disappointing  performance  for  a  carefully  machined 
alumimun  GP  with  12  radials,  built  in  our  machine  shop.  Various  mounting  schemes  did  not  improve 
performance,  but  some  very  expensive  RF  absorbing  material  wrapped  around  the  support  mast  below  the 
feed  point  and  down  for  a  half  wavelength  did  improve  the  observed  gain.  This,  once  remembered,  lead 
to  speculation  that  the  mast  under  the  radials  somehow  had  substantial  current  flowing  on  it,  contributing 
to  the  total  radiated  field  along  the  horizon.  And  the  contribution  might  be  subtractive.  Several 
commercial  GPs  were  available  in  double-skirted  (DS)  versions  (Figure  2).  We  tested  one  and  saw 
definite  improvement  over  the  single-skirted  (SS)  designs.  The  MININEC  models  of  the  DSGP,  however 
produced  gains  within  .5  dB  of  the  SSGP.  The  MININEC  models  contained  only  the  upper  radials,  the 
quarter-wave  vertical  connecting  mast  below  the  upper  radials  and  the  lower  radials.  No  masts  nor 
coaxial  cables  were  used  in  any  of  the  computer  models  up  to  this  point. 

Efforts  were  then  concentrated  on  measurements  in  an  attempt  to  improve  performance  and 
especially  understand  what  was  causing  the  differences  and  disappointments.  During  the  measurements 
on  the  QWSSGP  it  was  observed  that  the  loss  in  gain  was  coming  from  uptilt  of  the  usually  broadside 
beam.  The  amount  of  uptilt  was  a  function  of  the  length  of  the  supporting  mast.  It  was  speculated  that 
the  amount  of  feed  line  that  was  "exposed"  (not  lying  on  the  ground)  and  could  contribute  to  the  pattern 
was  a  factor.  The  coax  dipole's  quarter-wave  sleeve  is  the  limiting  case  of  a  QWSSGP  with  drooping 


52 


radials  that  have  drooped  to  -90°.  Up  to  this  point,  all  experiments  were  conducted  with  supporting  pipes 
that  were  of  some  arbitrary  length,  extending  below  the  lower  radials.  A  coax  dipole  was  modified  by 
adding  a  set  of  4  wire  radials  at  the  bottom  of  the  outer  sleeve.  An  immediate  increase  in  gain  was  noted, 
along  -with  a  nasty  input  impedance  change.  The  second  set  of  radials  was  added  at  the  feed  point,  making 
the  antenna  electrically  very  close  to  the  QWDSGP.  Gain  comparisons  between  the  QWDSGP  and  the 
"radialized"  coax  dipole  showed  close  difference  of  less  than  a  dB. 

Now  it  was  obvious  that  the  coax  feed  line  outer  conductor  was  a  definite  contributor  to 
measurable  amounts  of  radiated  field  at  the  horizon.  And  the  MfNINEC  modeling  had  NOT  included  the 
cable  as  a  conductor  because  all  good  hams  know  that  "transmission  lines  don't  radiate".  To  test  this 
theory,  a  MININEC  model  of  a  half-wave  vertical  dipole  with  a  balun  connected  at  right  angles  to  the 
dipole  with  a  coaxial  cable  extending  a  quarter-wave  from  the  feed,  then  dropping  straight  down  toward 
the  ground  was  tested.  Figure  3  shows  the  geometry  of  the  conductors  and  the  currents  on  the  -wires.  The 
balun  was  not  electrically  connected  to  the  dipole.  A  small  gap  represented  the  balun  effect  of  preventing 
the  conduction  of  dipole  current  onto  the  outside  of  the  coax  outer  conductor.  The  calculated  currents  on 
the  balun  and  coax  shield  are  from  dipole  field  coupling.  Patterns  from  the  "real  world"  dipole  and  a  free 
space  dipole  are  presented  in  Figure  4. 

The  next  piece  of  enlightening  information  came  by  accident  when  some  mast  material  was  placed 
about  a  half  wave  from  an  experimental  radiator.  The  fields  near  the  radiator  were  "normal"  when  there 
was  no  mast  -within  several  wavelengths  of  the  radiator.  But  when  copper  tubing  was  at  the  half-wave 
location,  a  large  disturbance  was  noted.  If  a  piece  of  Radio  Shack  TV  mast  (either  their  light-duty  thin 
mast  or  the  robust  hea-vy-duty  pipe)  was  placed  in  the  same  location  as  the  copper  pipe  was,  the 
disturbance  was  almost  unmeasureable.  Many  other  types  of  metallic  pipe  were  tested,  including  both  "el 
cheapo"  magnetic  stainless  steel  tubing  and  costly  non-magnetic  stainless  steel  conductors.  The  results  of 
several  tests  show  that  magnetic  steels,  such  as  the  Radio  Shack  mast,  seem  to  be  very  lossy  for  RF 
currents  at  VHF.  For  example:  a  Japanese  magnetic  stainless  steel  whip  became  uncomfortably  warm  to 
the  touch  after  a  few  minutes  of  being  fed  with  50  watts  at  145  and  450  MHz. 

During  earlier  tests  on  the  SSGP  and  DSGP,  the  type  of  masts  (steel,  copper  pipe,  wooden  sticks) 
used  to  support  the  antennas  under  tests  was  not  thought  to  have  been  important.  After  the  magnetic  mast 
discovery,  the  breakthrough  in  showing  feed  line  effects  came  when  gain  was  proven  to  be  dependent  on 
the  spacing  between  the  support  mast  and  the  bottom  pipe  of  the  ground  plane  antennas.  Clamping  the 
GP  anterma  directly  to  the  different  masts  gave  somewhat  different  results,  but  an  experiment  using  a 
short  insulating  mast  between  the  GP  pipe  and  the  support  mast  provided  the  optimum  mounting 
arrangement.  A  top  end  of  a  piece  of  insulating  mast  material  was  clamped  to  the  bottom  of  the  GP 
antenna  and  a  quarter-wave  below  the  bottom  GP  radials,  a  Radio  Shack  TV  mast  was  clamped  to  the 
insulating  support.  This  provided  about  a  QW  spacing  below  the  antenna  of  exposed  feed  line  coax. 
Then  the  coax  was  fed  through  the  RS  mast  and  down  and  away  from  the  GP  antenna.  It  seemed  that  the 
lossy  nature  of  the  magnetic  mast  eliminated  RF  currents  that  would  have  run  down  the  outside  of  the 
feed  line  below  the  antenna.  With  the  RS  mast  arrangement,  the  full  expected  gain  of  the  DSGP  was 
realized. 

The  next  chapter  in  this  antenna  soap  opera  deals  with  the  ability  of  MININEC  to  predict  the 
coupling  between  the  SSGP  &  DSGP  antenna  elements  and  the  outer  conductor  of  the  coax  feed  line.  If 
the  fields  around  and  the  currents  on  the  elements  can  induce  currents  on  the  feed  line,  the  radiation 
patterns  should  show  this.  Indeed,  when  various  lengths  of  conductor  are  attached  to  the  basic  DSGP 
antenna  in  a  MININEC  model,  the  patterns  and  the  horizon  gain  does  vary  as  observed.  In  fact,  the  notion 
that  the  first  set  of  radials  is  all  that  is  needed  to  isolate  the  radiating  elements  from  the  feed  line,  is  shown 
to  be  completely  bogus.  A  MININEC  model  of  a  triple-skirted  GP  clearly  shows  substantial  currents  on 
the  lower  radials  and  feed  line!  Patterns  for  an  SSGP  and  a  DSGP  with  coax  stubs  hanging  down  from 
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the  bottom  of  the  antenna  are  shown  in  Figures  5  &  6.  The  patterns  for  the  original  ideal  GP  antennas  are 
overlaid  for  comparison.  The  SSGP  with  feed  line  included  shows  as  much  as  -  4  dB  gain  drop  along  the 
horizon,  while  the  DSGP  with  feed  line  suffers  -3.8  dB  horizon  loss!  This  confirms  the  earlier 
measurements  and  validates  MININEC  as  an  analysis  tool,  but  only  after  all  the  facts  are  in  so  a  complete 
numerical  model  can  be  developed. 

Thus,  in  terms  of  lessons  learned,  a  design  of  a  well-known  and  popular  simple  antenna  using 
MININEC  will  yield  exactly  the  expected  results.  If  the  modeler  verifies  the  performance  with  careful 
measurements,  and  finds  substantial  differences,  he  might  conclude  that  MININEC  lies  and  that  the 
generic  antenna  design  is  flawed.  But  if  there  is  real  intent  and  enthusiasm  to  discover  "Mother  Nature's" 
rules  for  such  antennas,  careful  experiments  and  an  inquisitive  mind  can  guide  one  from  the  fog  into  the 
sunlight  where  the  expected  results  can  be  achieved.  The  real  fun  is  discovering  what  Mother  Nature  has 
known  all  along  and  then  feeding  MININEC  "...the  rest  of  the  story".  Once  MININEC  has  the  correct 
conductor  lineup,  the  results  will  match  observation. 

Prologue:  The  original  design  of  all  the  GP  antennas  came  from  the  bare  bones  MININEC  model.  Once 
the  need  for  an  insulating  intermediate  mast  was  discovered,  an  addition  glitch  appeared.  Using  metal 
clamps  to  attach  the  bottom  pipe  of  the  GP  (between  the  radial  levels)  to  the  insulated  mast  seems  to 
detune  the  structure,  wrecking  the  input  impedance.  A  simple  fix  to  restore  the  impedance  consists  of 
wrapping  the  antenna's  pipe  with  tape,  then  clamping  to  the  taped  surface.  A  guess  is  that  the  clamp 
seems  to  increase  the  effective  diameter  of  the  pipe  and  that  changes  the  phase  difference  between  the  two 
sets  of  radials. 

MARS  STATION  OVERLOAD 

While  working  at  a  Navy  HF  receiving  site  in  Maine  during  the  summer  of  1995,  we  discovered  an 
intermittent  very  strong  local  single-side-band  (SSB)  voice  and  frequency-shift  keyed  (FSK)  radio 
teletype  (RTTY)  signal  at  4.001  MHz  several  times  each  day.  The  signal  overloaded  the  primary 
multicouplers  and  other  components  of  the  RF  distribution  system  of  the  site.  The  source  of  the  signal 
was  traced  by  LCDR  Andrew  Parker,  WVIB,  to  amateur  station  NC4B  operating  as  a  Military  Affiliate 
Radio  System  (MARS)  station  NNN0IBM.  The  station  was  directly  across  Prospect  Harbor  at  a  distance 
of  1.8  km  from  the  center  of  the  vertically  polarized  HF  receiving  array  as  shown  in  Figure  7, 

After  contacting  the  station  operator  and  explaining  the  nature  of  the  problem,  the  antenna  system 
was  examined.  (The  operator  was  a  retired  Na\'y  LCDR.  She  had  been  in  charge  of  base  housing  in  the 
old  12th  Naval  District  which  included  NFS  housing,  and  was  very  cooperative.)  The  antenna  was  a 
G5RV  "all-band  dipole"  at  25  feet,  which  was  oriented  in-line  with  the  receiving  site.  This  produced  a 
polarization  match  with  the  victim  antenna.  The  G5RV  was  used  in  the  Near  Vertical  Incidence  SkyWave 
(NVIS)  mode  for  a  local  Navy  MARS  net,  and  not  a  low  angle  long-haul  mode.  That  meant  that  non- 
directional  coverage  was  desired  and  reorienting  the  antenna  by  90°  would  not  affect  the  coverage,  but 
could  help  the  overload  situation.  In  order  to  minimize  the  interference  to  the  receiving  site,  it  would  have 
been  desirable  to  relocate  the  G5RV,  if  it  were  possible  to  evaluate  the  antenna-to-antenna  coupling  as  a 
function  of  position  of  the  MARS  antenna. 

It  was  a  stroke  of  good  luck  that  I  had  earlier  received  a  pre-release  copy  of  a  terrain  analysis 
computer  program  from  Brian  Beezley,  K6STI,  the  creator  of  MN  and  AO,  which  he  dubbed  TA.  Brian 
had  seen  the  results  of  a  measurement  program  where  the  radiated  fields  from  a  collection  of  dipoles  and 
monopoles  located  in  front  of,  on  top  of  and  behind  a  small  hill  in  central  Utah.  Jim  Breakall,  WA3FET, 
had  been  instrumental  in  having  a  version  of  the  MOM-BSC  GTD/UTD  code  adapted  to  model  the  terrain 
and  antennas.  The  results  of  the  measurements  and  numerical  modeling  tweeked  Brian’s  curiosity.  He 
studied  diffraction  theory  and  decided  to  try  to  write  a  simple  2-D  PC  code  to  evaluate  the  effects  of 
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terrain  in  front  of  a  ham  antenna  site.  Brian  asked  Breakall  &  I  to  participate  in  the  evaluation  of  TA.  We 
immediately  both  sa^v  its  value  for  our  students  for  visualizing  diffraction  effects.  TA  produces  terrain 
modified  radiation  patterns  from  precalculated  free  space  patterns  of  wire  antennas.  Multiple  reflections 
and  diffractions  and  combinations  of  both  are  supported.  User  defined  terrain  profiles  can  be  in  the  form 
of  data  files  or  from  mouse-generated  terrain  slopes.  The  upper  portion  of  the  screen  contains  a  polar 
radiation  pattern,  with  the  terrain  profile  at  the  bottom.  The  antenna  position  can  be  mouse-adjusted  "on- 
the-fly"  to  see  changes  in  the  pattern.  Changes  in  terrain  also  are  almost  instantaneous.  The  user  can  also 
turn  on  the  display  of  reflection,  diffraction  and  multiple  diffraction.  Other  tweaks  add  controls  that  are 
not  available  in  the  3-D  NEC-BSC  code.  We  were  pleased  to  see  that  TA  runs  for  the  Utah  measurements 
were  quite  close  to  those  of  the  NEC-BSC  based  code. 

The  terrain  between  the  MARS  station  and  the  site  consists  of  cliffs  at  both  ends  and  a  salt  water 
bay  between.  The  cliff  at  the  transmitter  was  30  feet  above  the  water  and  sloped  downward  for  135  feet. 
The  bay  was  3350  feet  across.  At  the  receiver  end  of  the  path,  the  terrain  rose  50  feet  over  a  distance  of 
1775  feet,  as  shown  if  Figure  8.  The  receiving  monopoles  were  80  feet  high.  TA  was  used  to  locate  the 
G5RV  at  several  heights  above  the  ground  and  at  various  distances  from  the  edge  of  the  cliff  in  back  of 
the  house.  The  low  angle  calculated  gain  was  determined  in  the  direction  of  the  receiving  site  for  the 
original  geometry  and  for  several  new  locations  that  was  compatible  with  the  back  yard  tree  locations. 
Figure  9  is  a  sample  of  one  of  the  runs. 

Prologue:  A  height  of  20  feet  was  chosen  for  the  new  location  of  the  dipole,  using  a  pair  of  trees  on 
opposite  sides  of  the  back  yard.  The  impedance  of  the  G5RV  remained  acceptable.  As  soon  as  the 
relocation  was  completed,  the  NVIS  performance  was  checked  with  several  of  the  MARS  net  members. 
They  all  reported  and  increased  signal  with  the  new  orientation.  A  phone  call  was  then  made  to  the 
receiving  site  where  Ray  Vincent,  W6PUX  and  George  Munsch,  W5VPQ  were  waiting  to  measure  the 
new  signal  level.  The  relocation  and  reorientation  produced  a  20  dB  drop  in  signal  level  at  the  antenna 
terminals,  enough  to  eliminate  any  trace  of  overload.  The  ray  tracing  display  of  TA  showed  that  the  new 
location  eliminated  the  bounce  off  the  bay  water  surface,  with  only  cliff-edge  diffractions  adding  to  the 
direct  rays.  TA  has  now  become  part  of  our  standard  software  suite  we  use  on  laptop  PCs  during 
worldwide  field  trips  to  receiver  sites.  Its  quick  solution  time  and  revealing  visualization  of  rays  make  it 
superior  to  NEC-BSC  for  field  use. 

The  capability  of  mainframe-based  batch-mode  NEC-MoM  and  NEC-BSC  codes  includes  solving 
problems  like  these.  But  the  user-friendly  amateur  radio-based  codes  with  their  graphical  interface  and 
superior  output  displays  are  preferred  over  their  "big  brothers"  for  field-expedient  solutions  to  these  real- 
world  problems.  In  addition,  these  Ham  CEM  codes  easily  fit  on  PC  laptops  that  are  readily  available  in 
the  field. 
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FIG.  2  DOUBLE-SKIRTED 
GROUND  PLANE 
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FIG.  3  HALF-WAVE  VERTICAL  DIPOLE,  WITH  BALUN  AND  COAX  LINE,  SHOWNING 
DIPOLE  CURRENT  AND  INDUCED  CURRENTS  ON  BALUN  AND  COAX 


UERT.  DIPOLE/COflX  i  BftLUN  0  dB  E2NEC  1.0 


FIG.  4  PATTERNS  FOR  A  VERTICAL  DIPOLE  WITH  AND  WITHOUT  BALUN  AND  COAX 


Outer  Ring  =  3.00  dBi  Elevation  Plot 

Max.  Gain  =  2.54  dBi  fl2inuth  Angle  =  0.0  Deg 

FIG.  5  SINGLE-SKIRTED  GROUND  PLANE  WITH  AND  WITHOUT  MAST/COAX  STUBS 


DSGP  NO  SUPPORT  MOST  .  0  dB  E2NEC  1.0 


FIG.  6  DOUBLE-SKIRTED  GROUND  PLANE  WITH  AND  WITHOUT  MAST/COAX  STUBS 


FIG.  9  LOW  ANGLE  RADIATION  FROM  A  G5RV  DIPOLE  IN  THE  DIRECTION  OF 


THE  RECEIVING  SITE  FOR  16’  AND  40’  DIPOLE  HEIGHT 
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Development  of  Practical  Landstorfer  Antennas  for  Amateur  Use 
Marc  Tarplee 

Landstorfer  and  Sacher,  in  1985,  considered  how  the  performance  of  electrically  “long”  antennas  (total  length 
greater  than  Xll  )  could  be  optimized.  Their  approach  was  to  divide  the  antenna  into  a  large  number  of  small 
segments,  whose  positions  were  perturbed  to  optimize  some  aspect  of  the  antenna’s  performance.  They  looked 
at  ways  to  increase  the  effective  height  of  monopoles  and  ways  to  increase  the  directivity  of  dipoles.  Landstorfer 
and  Sacher  also  discussed  how  their  optimized  antennas  could  be  combined  into  Yagi-Uda  arrays  and  log- 
periodic  arrays.  They  tested  models  of  Yagis  and  LPDA’s  made  from  their  curvilinear  (bent)  dipoles  that  showed 
considerable  promise.  They  did  little  optimization  of  their  designs  due  to  the  limited  performance  of  the 
computers  available  to  them.  [1] 

Since  his  original  work,  few,  if  any,  amateur  antennas  have  been  constructed  using  the  principles  outlined  in  [1]. 
There  are  probably  several  reasons  for  this.  First,  not  many  amateurs  were  familiar  with  Landstorfer’s  and 
Sacher's  work.  Second,  computer  software  capable  of  performing  the  required  optimizations  was  not  available  to 
amateur  radio  operators  until  recently.  Third,  Landstorfer’s  and  Sacher’s  original  designs  are  not  easy  to 
fabricate. 

In  this  paper,  I  will  look  at  three  issues  surrounding  the  Landstorfer  antenna; 

1.  What  are  the  properties  of  a  "good”  approximation  to  Landstorfer’s  curvilinear  dipoles  which  is  simple 
to  fabricate  ?  I  will  try  to  answer  questions  such  as;  How  long  should  the  dipole  be  ?  How  many 
segments  should  it  contain?  What  is  the  driving  point  impedance? 

2.  How  can  the  Landstorfer  dipole  be  used  by  amateurs? 

3.  How  can  Landstorfer  dipoles  be  used  in  arrays? 

All  antenna  analysis/optimization  required  for  this  paper  will  be  done  with  AO  Antenna  Optimizer  version  6.36 
and  NECAA/ires  version  1.62.  Prior  to  entering  a  discussion  of  the  three  issues  outlined  above,  it  is  worthwhile  to 
use  AO  and  NECAA/ires  to  reproduce  Landstorfer’s  results  given  in  [1]  as  a  starting  point. 

Review  of  Landstorfer’s  original  work 

Landstorfer  [1]  explored  in  some  detail,  the  properties  of  a  3X/4  monopole  bent  in  order  to  optimize  directivity. 
Below  is  a  comparison  of  Landstorfer’s  results  and  my  results  using  AO 

Table  1  Comparison  of  calculated  shapes  of  bent  monopoles  optimized  for  directivity 


segment  angles  (deg) _ Directivity  Input  Impedance 


N 

a1 

a2 

a3 

a3 

a3 

a3 

D  (dbi) 

Zin  (Q) 

1 

2 

6 

31  (34) 
43 (48) 
33  (35) 

8(14) 

53 (57) 

40  (51) 

12(22) 

2(10) 

17(10) 

5.1  (5.9) 
6.9  (6.8) 

7.1  (6.9) 

85  (53+j11) 

83  (50+j4) 

86  (55+j15) 

Notes; 

Boldface  data  are  results  from  AO;  non-boldface  are  Landstorfer’s  [1] 

AO  was  set  to  1 0  segments  per  half  wave  with  automatic  tapering  enabled 
The  antennas  optimized  by  AO  had  a  diameter  equal  to  X/1000. 

All  segments  were  the  same  length  i.e.  3W4N 

Directivity  optimization  in  AO  was  done  by  modeling  equivalent  dipole 
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The  agreement  between  Landstorfer's  values  and  mine  is  fairly  good.  Landstorfer  assumed  that  the  current 
distribution  in  each  segment  is  sinusoidal  which  would  lead  to  slightly  different  results.  As  a  further  check,  the 
directivities  and  input  impedances  were  computed  again  using  NecWires.  The  results  appear  in  Table  2  below: 

Table  2  Comparison  of  AO  and  NW  Results 


Results  from  AO 

Results 

NecWires 

from 

N 

Directivity  (dbi) 

Zin  (D) 

Directivity  (dBi) 

Zin  (O) 

1 

5.9 

106+j22 

5.9 

105+j7 

2 

6.8 

99+j8 

6.7 

102-j10 

6 

6.9 

109+j29 

7.0 

106+j8 

There  is  good  agreement  between  the  two  programs. 


Basic  Properties  of  the  Landstorfer  dipole 
a)  Number  of  Segments  and  directivity: 

A  1 .5X  Landstorfer  dipole  was  optimized  for  directivity  using  various  numbers  of  segments.  The  results  appear  in 
Figure  1  below: 

FIGURE  1 


Directivity  of  a  1 .50  wavelength  Landstorfer  dipole 


Number  of  Segments,  N  »> 


It  is  obvious  from  Fig  1  that  one  does  not  need  to  use  a  large  number  of  segments  to  realize  most  of  the 
directivity  available  from  this  antenna.  In  fact,  2  segments  would  be  sufficient;  one  gains  less  than  0.1  dB  of 
additional  directivity  by  increasing  to  12  segments.  Note  that  the  2  segment  antenna  has  a  moderate  advantage 
over  a  single  segment  antenna  (which  is  really  a  resonant  vee  antenna).  This  simplifies  the  construction  of 
Landstorfer  curvilinear  elements:  one  needs  only  one  bend  at  the  middle  of  each  half  element,  as  opposed  to  the 
continuously  curved  elements  shown  in  [1] 
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b)  Antenna  length  and  directivity 


For  a  straight  dipole,  the  directivity  increases  to  a  maximum  at  1  s  1.28?.  and  decreases  for  longer  antennas  [2], 
If  one  wishes  to  use  a  dipole  as  an  element  of  a  Yagi  array  or  LPDA,  it  is  not  just  the  overall  directivity  that  is 
important,  but  the  directivity  in  the  direction  perpendicular  to  the  axis  of  the  antenna  (broadside  directivity, 
denoted  by  DO)  that  is  important.  DO  values  for  Landstorfer  dipoles  are  much  greater  than  those  of 
corresponding  straight  wire  antennas.  Figure  2  below  shows  this  variation  with  overall  antenna  length: 

FIGURE  2 


Directivity  of  Various  Types 
of  dipoles  as  a  function  of  length 


7 


length  (wavelengths)  »> 


Note  that  the  directivity  of  a  Landstorfer  dipole  quickly  increases  to  4.9  dBd  when  the  length  reaches  1 .501  and 
remains  almost  constant  to  nearly  2.001.  The  next  plateau  comes  at  2.501,  where  the  directivity  is  approximately 
6.7  dBd.  It  is  not  known  if  the  directivity  continues  to  increase  for  lengths  beyond  3.001.  It  appears  that  a  length 
of  1.501  seems  to  be  a  good  compromise;  doubling  the  length  to  3.001  would  add  less  than  2  dB  to  the 
directivity  and  there  is  no  reason  to  believe  that  at  some  larger  length  that  there  would  be  a  sudden  increase  in 
directivity. 

c)  Input  Impedance 

Figure  3  shows  how  the  input  impedance  of  a  Landstorfer  dipole  varies  with  length  in  the  range  0.501  <  1  < 
3.001.  Resonance  occurs  at  I  =  1.471  and  2.501.  and  there  is  an  anti  resonance  at  0.841.  The  input  impedance 
at  resonance  is  on  the  order  of  100  ohms,  so  the  antenna  could  be  fed  directly  with  75  ohm  coax,  or  matched  to 
50  ohm  coax  with  a  75  ohm  quarter-wave  matching  section. 

d)  SWR  Bandwidth 

Figure  4  compares  the  bandwidth  of  a  half  wave  dipole  and  a  Landstorfer  dipole  of  similar  diameter  (1/110).  The 
Landstorfer  dipole  has  a  narrower  bandwidth,  which  is  not  a  complete  surprise.  Other  authors  have  noted  that 
resonant  vee  and  long  wire  antennas  have  impedances  that  change  quite  rapidly  with  frequency.  The 
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explanation  is  that  a  fixed  percentage  change  in  a  long  antenna  is  much  greater  (in  terms  of  a  wavelength)  than 
a  short  antenna  [3] 

Although  the  Landstorfer  antenna  is  relatively  narrow-band,  it  still  has  sufficient  bandwidth  to  cover  the  entire  2 
meter  band  when  its  diameter  is  relatively  large  (A,/100).  If  this  antenna  were  scaled  to  14.175  MHz  and  made 
from  #12  wire  (dia  =  WIODOO)  the  antenna  would  still  have  a  2:1  SWR  bandwidth  of  about  470  kHz.  enough  to 
cover  the  20  meter  band. 


FIGURE  3 

Impedance  of  a  Landstorfer  dipole  as  a  function  of  length 


800 


Re(Z)  j 
lm(Z)  I 


FIGURE  4 


Calculated  SWR  Bandwidth 
of  a  Landstorfer  Dipole 
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Uses  of  a  single  Landstorfer  Dipole 


a)  Horizontally  polarized  gain  antenna  for  HF 

A  1.5X  Landstorfer  dipole  has  a  directivity  similar  to  a  2  element  Yagi  (approximately  5  dBd),but  is  triple  the 
length  of  a  half  wave  dipole,  which  limits  its  use  on  HF.  (I  cannot  imagine  anyone  erecting  a  160  meter 
Landstorfer  dipole;  it  would  be  over  800  feet  long)  it  would  require  three  supports,  unlike  a  dipole  which  requires 
two,  or  an  inverted  vee,  which  requires  one.  Application  on  HF  would  probably  limited  to  10  and  12  meters. 

b)  ground-mounted  HF  monopole 

A  2  segment  0.75;i  Landstorfer  monopole  is  about  0.6X.  high.  This  certainly  makes  it  possible  to  erect  this  type 
of  antenna  on  the  10  through  20  meter  bands.  Table  3  compares  the  performance  of  2  common  types  of 
vertical  antennas  ant  the  Landstorfer  monopole: 

TABLE  3 


TYPE  OF  ANTENNA 

MAX 

(dBd) 

RADIATION  ELEVATION  ANGLE 
OF  MAX  (») 

EFFICIENCY  (15  n 
GROUND  LOSS) 

0.25?.  monopole 

-3.96 

26 

60% 

0.625X,  monopole 

-1.30 

16 

80% 

0.75?.  Landstorfer 

-0.84 

16 

77% 

Looking  only  at  this  table,  one  would  not  see  much  of  a  difference  in  performance  between  a  5?J8  monopole  and 
the  Landstorfer  monopole.  These  numbers  are  not  the  whole  story,  however.  The  radiation  pattern  of  the  5?J8 
monopole  has  more  unwanted  high  angle  radiation  than  the  Landstorfer  antenna 

c)  VHF  antennas 

The  electrically  “large”  dimensions  of  the  Landstorfer  antenna  are  not  problematic  on  the  VHF  bands.  A  6  meter 
Landstorfer  dipole  made  from  aluminum  tubing  is  about  the  size  of  the  reflector  on  a  triband  HF  Yagi.  1  have 
constructed  a  2  meter  Landstorfer  dipole  which  is  mounted  vertically  and  used  to  work  local  repeaters.  The 
antenna  measures  about  8  feet  across.  While  it  is  certainly  larger  than  a  2  meter  ground  plane  or  dipole,  it  is  still 
manageable  and  offers  some  gain. 

A  6  meter  Landstorfer  monopole  should  be  a  good  DX  antenna  with  its  gain  and  low  angle  of  radiation.  I  am 
constructing  one  which  will  be  ground  mounted  and  fed  against  a  28  ft  by  48  ft  counterpoise  containing  40  radial 
wires.  It  will  be  tested  during  the  spring  of  1996. 

The  Landstorfer  monopole  can  also  be  used  as  part  of  a  ground  plane  antenna.  The  author  compared  quarter 
wave,  5/8  wave  and  3/4  wave  Landstorfer  ground  plane  antennas  using  AO.  The  ground  plane  consisted  of  4 
wires  spaced  at  90  degree  intervals,  parallel  to  the  ground.  The  Landstorfer  design  performed  better  than  either 
straight  antenna.  It  had  about  2.5  dB  more  gain  than  the  quarter  wave  antenna  and  2  dB  more  than  the  5/8  wave 
design.  The  optimal  radial  length  is  about  0.35>.  and  the  input  impedance  is  around  50+j30  Q. 

Arrays  of  Landstorfer  Dipoles 

a)  Log  Periodic  Arrays 

Anyone  interested  in  LPDA’s  made  from  Landstorfer  dipoles  should  review  chapter  5  of  [1].  Landstorfer  and 
Sacher  constructed  a  series  of  UHF  LPDA’s  and  measured  their  properties.  Their  6  element  470  -  610  MHz 
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design  had  a  directivity  of  10  dBd  with  sidelobe  attenuation  of  at  least  17  dB.  It  is  not  possible  to  get  this  kind  of 
performance  from  a  standard  LPDA  with  the  same  number  of  elements  and  same  boom  length. 


b)  Parasitic  (Yagi-Uda)  Arrays 

The  high  directivity  of  the  Landstorfer  dipole  makes  it  the  ideal  building  block  for  a  parasitic  array.  I  examined 
Yagis  having  2  to  6  elements.,  I  attempted  to  optimize  each  Yagi  for  improved  directivity  and  F/B  ratio.  The 
antennas  had  resonant  drivers  (  length  =  0.738X)  and  element  diameters  of  V1 10.  I  used  AO’s  optimization 
routine,  with  the  following  weighted  constraints: 


Gain 

40% 

no  constraint 

F/B 

40% 

F/B  >  20  dB 

drive  resistance 

20% 

Re(Z)  >  20  ohms 

2  element  Yagis 


Below  is  an  optimized  2  element  design: 


Reflector  Length 
Driver  Length 
Element  Diameter 
Element  Separation 
Directivity 
Front/Back  Ratio 
Input  impedance 


0.743X 
0.71  U 
V110 
0.122^ 

8.3  dBd 
28  dB 
45+j5ohms 


Lawson  [4]  states  that  the  maximum  directivity  of  a  standard  2  element  Yagi  is  6.70  dBi  or  4.6  dBd.  The 
Landstorfer  Yagi  shows  more  than  3  dB  improvement ,  which  is  equivalent  to  doubling  one’s  transmitter  power 


I  constructed  a  146  MHz  version  of  this  2  element  Yagi  out  of  7/8  inch  OD  aluminum  tubing.  I  had  no  way  to 
measure  the  impedance  of  the  antenna,  but  the  SWR  at  resonance  was  1.2,  which  was  consistent  with  the 
model.  I  do  not  have  access  to  an  antenna  range,  so  I  had  no  way  to  verify  either  the  F/B  or  gain  figures 
produced  by  the  model.  I  did  mount  the  antenna  at  a  height  of  8  feet  in  by  back  yard  and  found  that  its 
performance  was  nearly  equal  to  a  5  element  quad  array  similarly  mounted. 


3  element  Yagis 

I  have  chosen  to  compare  a  “good”  3  element  Landstorfer  Yagi  against  an  8  element  design  found  in  [4].  The  3 
element  design  compares  quite  favorably  against  Lawson's  8  element  Yagi  in  all  areas. 


TABLE  5 


PROPERTY 

3-EL  LANDSTORFER  YAGI 

8  EL  LAWSON  YAGI  [4] 

Reflector  Length 

0,734/. 

0.4950?. 

Driver  Length 

0.725?. 

0.4726?. 

Director  Length 

0,703?. 

0.4240?.  (all  6  directors) 

Element  Diameter 

V110 

V470 

Element  Separations 

0.147?. 

.184?. 

Boom  Length 

0.294X 

1288?. 

Directivity  (dBd) 

10.0 

9.75 

Front/Back  Ratio  (dB) 

23 

29.1 

Input  Impedance  (ohms) 

17+jO 

34+jO 
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Yagis  with  4  or  more  elements 

i  have  optimized  4.  5  and  6  element  Landstorfer  Yagi  designs.  { I  stopped  after  6  because  AO  would  not  handle 
any  more  symbols).  Figure  5  compares  the  gain  of  Landstorfer  and  conventional  Yagi  arrays.  Data  for 
conventional  Yagis  came  from  [4],  Although  the  Landstorfer  Yagi  offers  approximately  3  dB  more  gain  than  a 
conventional  Yagi  of  similar  boom  length,  one  does  not  get  something  for  nothing;  the  Landstorfer  array  has 
three  times  the  width. 

FIGURE  5: 


GAIN  COMPARISON;  LANDSTORFER  VS  DIPOLE  YAGI 
ARRAYS 


Summary 


Landstorfer  antennas  have  not  seen  much  application  in  amateur  circles.  The  computer  studies  that  i  have 
undertaken  and  the  limited  experiments  that  I  have  conducted  indicate  that  this  antenna  has  potential.  The 
possibility  of  high  directivity  on  a  short  boom  with  few  elements  is  very  appealing. 

Other  areas  that  could  yield  interesting  results  are  the  optimization  of  a  Landstorfer  monopoie  to  maximize 
radiation  at  a  specific  elevation  angle  and  the  application  of  Landstorfer's  ideas  to  traveling  wave  antennas  such 
as  rhombics. 

Hams  like  to  experiment  with  antennas  and  I  hope  that  this  article  will  spur  them  to  begin  experimentation  with  a 
new  type  of  antenna. 
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Abstract  -  The  Numerical  Electromagnetics 
Code  (NEC)  provides,  by  means  of  an  “NT”  data 
file  entry,  for  a  two-port  nonradiating  network  to 
be  connected  between  any  two  segments  in  a  wire 
antenna  structure.  For  NEC,  the  electrical  be¬ 
havior  of  such  a  network  is  specified  by  an  admit¬ 
tance  matrix.  Many  wire  antenna  types  employ 
baluns  in  their  actual  implementation.  This  pa¬ 
per  describes  a  method  for  the  characterization 
of  baluns  which  allows  their  inclusion,  as  tw'o- 
port  netw'orks,  in  NEC  modeling  and  analysis  on 
a  routine  basis.  Baiun  characterizations  from  the 
procedure  described  here  are  also  useful  in  other 
practical  applications, 

I.  Introduction 

Actual  HF/VHF  ware  antennas  often  employ  at 
least  one  balun  for  the  purpose  of  converting  an 
unbalanced  coaxial  cable  input  into  a  balanced 
output.  These  baluns  vary  widely  in  their  electri¬ 
cal  characteristics,  depending  on  the  balun  type 
and  its  construction,  the  operating  frequency, 
and  load  impedance  (i.e.,  nature  and  extent  of 
mismatch).  The  most  common  nominal  balun 
impedance  transformation  ratios  (output  to  in¬ 
put)  are  1:1,  4:1,  and  9:1,  and  baluns  operat¬ 
ing  in  close  proximity  to  their  design  impedances 
are  generally  well-behaved,  in  the  sense  that 
they  reasonably  approximate  ideal  transformers 
in  that  case.  However,  practical  communicators 
quickly  learn  from  experience  that  real  baluns  can 
radically  deviate  from  ideal  transformer  behavior 
when  the  load  impedance  varies  from  the  design 
value.  Under  mismatched  operating  conditions, 
the  matrix  elements  describing  a  particular  balun 


are  not  available  from  ‘spec  sheets’  or  reference 
handbooks.  The  only  recourse  for  reliable  deter¬ 
mination  of  the  matrix  elements  is  from  actual 
measurements  on  individual  baluns. 

Nowadays,  the  instrument  of  preference  for 
network  characterization  is  the  vector  network 
analyzer.  If  one  does  not  own  a  network  ana¬ 
lyzer,  such  an  instrument  is  generally  accessible 
at  a  nearby  university  or  research  laboratory,  so  it 
is  fair  to  say  that  vector  network  analyzers  for  the 
HF  (3-30  MHz)  and  VHF  (30-300  MHz)  bands 
are  now  generally  available.  Vector  network  an¬ 
alyzers  have  two  ports  for  the  connection  of  a 
Device  Under  Test  (DUT),  and  can  measure  the 
four  scattering  parameters  Sn,  5i2, 52i,  and  S'22, 
w'hich  are  four  complex  quantities  as  described  in 
[1]  and  elsewhere.  All  linear  two-port  networks, 
including  baluns,  can  be  characterized  fully  (at  a 
given  frequency)  by  a  2  x  2  matrix  of  complex  pa¬ 
rameters,  but  the  matrix  type  is  not  unique  and 
numerous  useful  descriptors  may  be  defined  .  For 
example,  the  netw^ork  may  be  described  by  a  scat¬ 
tering  matrix,  5,  transmission  or  chain  matrix, 
T,  impedance  matrix,  Z,  or  admittance  matrix, 
Y-  There  are  straightfori-vard  conversion  formu¬ 
las  among  the  different  matrix  types  so  that,  for 
example,  admittance  element  values  required  by 
NEC  can  be  readily  calculated  from  scattering 
matrix  values  [2] . 

Because  the  network  analyzer  ports  are  both 
50f2  unbalanced,  whereas  one  port  of  the  balun 
is  balanced  and  usually  designed  for  a  different 
impedance,  one  cannot  obtain  a  reliable  measure¬ 
ment  of  scattering  parameters  with  the  balun  di¬ 
rectly  connected  between  the  two  analyzer  ports. 
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This  paper  describes  a  method  which  overcomes 
this  problem.  The  four  s-parameters  which  com¬ 
pletely  characterize  the  balun  electrically  at  a 
given  operating  frequency  are  determined  from 
three  one-port  {Sn)  measurement  values  with  the 
unbalanced  balun  port  connected  to  the  network 
analyzer  Port  1,  and  the  balanced  output  of  the 
balun  terminated  in  specified  loads. 


II.  Theoretical  Basis  for  Procedure 


The  complex  wave  amplitudes  |a)  = 


ai 

02 


and 


\b)  = 


associated  with  a  linear  two-port 


network,  as  depicted  in  Figure  1,  are  related 
through  the  matrix  equation 


ifi)  =5  |a) 


(1) 


where 


5'= 


511  5i2 
521  522 


(2) 


In  the  context  of  the  present  problem,  Port  1  rep¬ 
resents  the  unbalanced  balun  input  and  Port  2 
represents  the  balanced  output,  which  is  avail¬ 
able  for  load  termination  changes. 


Fig.  1.  Generic  two-port  network. 


Network 

Analyzer 

HP8505A 

1  ^  Port  2 


Fig,  2.  Measurement  setup. 


One  of  the  two  linear  equations  contained  in 
Eq.  1  is 

bi  =  5iiai  +  5i202-  (3) 

If  Port  2  is  terminated  in  a  matched  load,  i.e.,  the 
balun  balanced  output  is  terminated  in  its  design 
resistance,  02  =  0  and  61  =  5iiai,  or 

Sn  =  -  =  r„,  (4) 

Port  2  matched 

where  Pm  denotes  the  value  of  5ii  under  a 
matched-load  condition. 

If  Port  2  is  now  terminated  in  a  short  circuit, 
a2  =  —62  and  the  equation  pair  embodied  in  Eq. 
1  becomes 

bi  =  5iiai  +  512^2  =  Pm^^i  +  5i2<l2  (5) 

and 

(&2  =)  -a2  =  Suai  +  S22Q'2-  (6) 

Similarly,  termination  of  Port  2  in  an  open  circuit 
causes  02  =  +62,  so  Eq.  5  remains  valid,  but  now 

(62  =)  -I-  02  =  5i20i  -f  52202-  (7) 


To  begin,  assume  the  balun  is  reciprocal,  an 
assumption  which  is  substantiated  by  measure¬ 
ments.  This  immediately  gives  5i2  =  52i  and 
reduces  the  number  of  outstanding  unknowns  to 
three. 


There  are  at  least  two  ways  to  arrive  at  an 
expression  for  5i2.  In  the  first  approach,  Eq.  5 
divided  by  oi  gives  the  short-circuited  condition 
reflection  coefficient 

-I  -4-512-  (8) 

O-l  I  Port  2  shorted  Oi 
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which  re-arranges  to 

a2 

0-1  Si2 

Similarly,  division  of  Eq.  6  by 
re-arrangement,  yields 

^  —  *^12 
ai  S'22  + 1 


(9) 

followed  by 
(10) 


Equating  the  Right-Hand  Side  (RHS)  expressions 
of  Eq.  9  and  10  produces  the  first  result 


S!2  =  {S22  +  l)iTm-Tsl  (11) 


where  we  take  the  angle  of  the  complex  RHS  re¬ 
sult  to  be  restricted  to  —180°  <  ©  <  +180°  so 
that  5i2  is  single-valued. 

A  second  approach  is  to  first  obtain  an  expres¬ 
sion  for  the  open-circuited  reflection  coefficient 
through  division  of  Eq.  5  by  oj : 


0-1  Port  2  open 


=  r,  =  r„  +  Si2^, 

ai 


which  re-arranges  to 


(12) 


O2  _  Eq  EfTi 

01  Si2 


(13) 


Division  of  Eq.  7  by  02,  followed  by  re¬ 
arrangement,  gives 


02  S12 

ai  1  —  S'22 


(14) 


Equation  of  the  RHS  expressions  of  Eq.  13  and 
14  produces  the  second  result 


5?2  =  (ro-r„)(i-S22).  (15) 


There  is  no  reason  to  prefer  Eq.  11  over  Eq. 
15,  or  vice  versa,  as  a  determination  of  the  value 
of  S12  from  measured  data.  As  one  might  expect, 
the  two  results  are  very  close  in  value,  if  not  iden¬ 
tical.  In  practice,  then,  it  is  reasonable  to  average 
the  two  results. 

Finally,  if  one  equates  the  RHS  expressions  of 
Eq.  11  and  Eq.  15,  and  algebraically  solves  for 
S'22,  the  following  expression  is  obtained; 

o  Eo  +  Fs— 2rm 

r„-r.  ■ 


At  an  operating  frequency  of  interest,  5ii  is  mea¬ 
sured  for  a  given  balun  terminated  in  three  dif¬ 
ferent  loads:  matched,  open-circuit,  and  short- 
circuit.  The  numerical  values  obtained  by  mea¬ 
surement  are  denoted  TmiEo,  and  Fs,  respec¬ 
tively.  A  straightforward  pc  computer  program 
then  applies  Equations  4,  11,  15,  and  16  to  cal¬ 
culate  5'ii,5i2,  and  S22  for  the  balun  under  test. 
S21  =  S12  by  reciprocity,  which  completes  the  S 

matrix  characterization  of  the  balun. 

Admittance  parameters  are  required  for  the 
NEC  data  file.  The  translation  from  S  to  T, 
again  by  computer,  is  accomplished  through  the 
conversion  relation 


El  E2 
E3  EA 


(17) 


where 


^  {l-Sn){l-S22)-SnS2i 

(17a) 

(1  +  522)(1  -  ^ii)  +  5i2S'21 

ft, 

(17b) 

25'i2 

^/RclRc2 

(17c) 

E. 

\/RciRc2  ’ 

(17d) 

C  (1  +  Sii)(l  -  522)  +  5'i2521 
ft2 

(17c) 

In  the  above  equations,  Rci  and  Rc2  denote  the 
nominal  characteristic  impedances  at  Port  1  and 
Port  2  of  the  two-port  balun.  For  example, 
Rci  =  50D  and  Rc2  =  450^  could  be  taken  as 
appropriate  values  for  a  9:1  balun. 


IV.  Another  Practical  Application 

With  the  matrix  S  in  hand,  another  useful  ap¬ 
plication  is  readily  available.  Namely,  one  can 
predict  the  input  reflection  coefficient  Ej,  look¬ 
ing  into  Port  1  of  the  balun,  when  the  output 
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Port  2  is  terminated  in  an  arbitrary,  but  known, 
impedance  Zi,. 

The  chain  or  transmission  matrix, 


A  B 
C  D 


is  also  well  known  as  the  ABCD  ma¬ 


trix.  For  computer-based  analysis  of  cascaded 
two-ports,  an  ABCD  matrix  description  is  ad¬ 
vantageous  (as  discussed  in  reference  [2]).  The 
conversion  from  5  f o  T  is  via 


Ai  A2 
A3  A4 


(18) 


where 


A  = 


2521  ’ 


(18a) 


Rci, 


Ai  =  J  [(1  -I-  5ii)(l  -  S22)  +  5i252i]  , 

^ _ L  (18b) 

A2  =  \/Rc\Rc2  [(1  +  5ii)(l  -f  S22)  -  5i252i]  , 

(18c) 

^3  -  /r}  -rr-  1(1  ~  ‘5'n)(l  -  522)  -  5i252i]  , 


(18d) 


and 


A4  =  [(1  -  5ii)(l  -f  522)  +  5i252i]  . 

(18e) 

Nominal  port  characteristic  impedances  Rd  and 
Rc2  are  the  same  as  in  the  conversion  to  admit¬ 
tance  parameters. 

Take  the  system  impedance  to  be  Zq  ==  50f2 
and  assume  the  load  impedance  on  the  balanced 
output  side  of  the  balun  is  Z^.  The  calculated,  or 
predicted,  input  impedance  (as  seen  looking  into 
Port  1  of  the  balun)  is  then 


AZl  +  B 
*  CZl^D 


(19) 


and 


Zj  —  Zq 
Zi  +  Zq 


(20) 


The  capability  to  predict  input  impedance  and 
mismatch  for  a  balun  with  an  arbitrary  complex 
termination  is  of  considerable  interest  to  rf  engi¬ 
neers  concerned  with  practical  radio  communica¬ 
tion  systems  at  HF  and/or  VHF  frequencies. 


A.  Example  of  Results 

A  commercially  available  4:1  HF  balun  was  char¬ 
acterized  by  the  procedure  described  above.  Be¬ 
cause  the  S  matrix  determination  is  made  at  a 

specific  operating  frequency,  one  may  ask  how 
many  discrete  characterizations  are  required,  as 
a  practical  matter,  to  cover  the  full  3-30  MHz 
HF  band.  Table  I  below  shows  measured  5u  un¬ 
der  the  three  conditions  required  for  character¬ 
ization  for  this  example  case,  and  is  representa¬ 
tive  of  the  frequency  sensitivity  of  typical  baluns. 
The  table  contents  suggest  that  linear  interpola¬ 
tion  between  values  at  discrete  1  MHz  intervals 
is  acceptable  (except  possibly  below  5  MHz). 


Table  I.  Measured  Sn  (in  dB,  degrees)  | 

Freq.  (MHz) 

Short 

Open 

200n  Load 

3 

0.0/172 

0.0/54 

-10.3/142 

4 

0.0/169 

0.0/42 

-11.0/142 

5 

0.0/166 

0.0/32 

-11.4/142 

6 

0.0/162 

0.0/25 

-11.6/141 

7 

0.0/158 

0.0/20 

-11.7/140 

8 

0.0/155 

0.0/15 

-11.7/138 

9 

0.0/152 

0.0/12 

-11.6/136 

10 

0.0/148 

0.0/8 

-11.5/133 

11 

0.0/145 

-0.1/6 

-11.4/131 

12 

-0.1/142 

-0.1/4 

-11.3/129 

13 

-0.1/139 

-0.2/2 

-11.2/127 

14 

-0.1/136 

-0.2/0 

-11.0/125 

15 

-0.2/132 

-0.2/0 

-10.8/122 

16 

-0.2/130 

-0.3/  -  2 

-10.7/120 

17 

-0.3/127 

-0.3/ -3 

-10.5/118 

18 

-0.4/124 

-0.3/ -5 

-10.3/116 

19 

-0.4/122 

-0.3/ -6 

-10.2/114 

20 

-0.5/119 

-0.3/  -  7 

-10.0/113 

21 

-0.5/117 

-0.3/ -9 

-9-8/111 

22 

-0.4/115 

-0.2/  -  10 

-9.5/110 

23 

-0.4/113 

-0.2/  -  11 

-9.2/108 

24 

-0.3/110 

-0.1/  -13 

-8.9/106 

25 

-0.3/108 

-0.1/  -15 

-8.8/104 

26 

-0.3/105 

-0.1/  -  17 

-8.6/102 

27 

-0.3/104 

-0.1/  -  18 

-8.4/100 

28 

-0.3/101 

-0.1/  -  20 

-8.2/99 

29 

-0.3/99 

-0.1/  -21 

-8.1/97 

30 

-0.3/97 

-0.1/  -23 

-7.9/96 
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For  purposes  of  illustration,  the  input  reflection 
coefficient  predicted  by  the  subject  characteriza¬ 
tion  method  is  compared  to  measured  data,  for 
three  widely  different  complex  loads  at  three  fre¬ 
quencies.  The  nine  cases  are  identified  in  Table 
II  below. 


I  Table  II.  Test  case  loads.  | 

Case  Name 

Freq.  (MHz) 

(n) 

3A 

3.0 

47  -  j64.7 

3B 

3.0 

150- j  160.8 

3C 

3.0 

9A 

9.0 

mirmsfiwm 

9B 

9C 

9.0 

15A 

15B 

15.0 

150  -  j32.2 

15C 

15.0 

470  -  j 272.1 

For  convenience,  a  definition  of  error  based  on 
the  geometrical  relationships  of  Fig.  3  is  taken 
here.  With  regard  to  Fig.  3,  let  M  and  P  denote, 
respectively,  the  lengths  of  the  vectors  connect¬ 
ing  the  origin  to  the  measured  and  predicted  Tj 
points.  Then,  if  E  denotes  the  length  of  the  vec¬ 
tor  connecting  the  predicted  and  measured  points 
in  the  T-pIane,  relative  error  is 

%  Error  =  X  100.  (21) 

M  ^  ^ 

lm(  Tj) 


To  emphasize  the  fact  that  rf  baluns  generally 
do  not  emulate  ideal  impedance  transformation 
behavior  well  under  mismatched  conditions,  first 
consider  the  results  for  Fj  under  the  assumption 
that  the  balun  is  an  ideal  impedance  transformer. 
These  results  are  summarized  in  Table  III. 


[  Table  III.  Ideal  impedance  transformer.  | 

Case  Name 

Ti  Error  (see  Eq.  21) 

3A 

25.4% 

3B 

108.1% 

3C 

83.5% 

9A 

51.6% 

9B 

92.5% 

9C 

33.7% 

15A 

74.9% 

15B 

99.6% 

15C 

93.8% 

AVERAGE  ERROR  =  73.7%  | 

When  the  S  matrix  procedure  described  in  this 

paper  is  follow^ed,  the  predicted  values  of  Tin  are 
much  closer  to  actual  measurement  values.  A 
summary  of  these  results  is  contained  in  Table 
IV. 


Table  IV.  Results  using  S  characterization. 

Case  Name 

Ti  Error  (see  Eq.  21) 

3A 

12.2% 

3B 

34.7% 

3C 

1.9% 

9A 

5.6% 

9B 

12.1% 

9C 

1.2% 

15A 

8.4% 

15B 

7.0% 

15C 

2.5% 

AVERAGE  ERROR  =  9.5%  | 

Apart  from  case  3B,  the  effectiveness  of  the  balun 
characterization  procedure  described  in  this  pa¬ 
per  is  apparent. 


V.  Conclusions 

The  results  presented  in  the  preceding  Exam¬ 
ple  are  typical  and  representative  of  the  capabil- 


72 


ity  of  the  balun  characterization  procedure  pre¬ 
sented  in  this  paper.  This  method  provides  a  y- 
parameter  description  of  a  given  balun,  by  means 
of  computer-based  analysis  of  measured  Sn  ‘cali¬ 
bration’  values,  which  is  of  sufficient  accuracy  for 
most  NEC  wire  antenna  modeling  requirements. 

A  limitation  on  the  accuracy  and  reliability  of 
the  method  arises  from  the  fact  that  a  minimal 
number  of  load  impedances  is  used  to  obtain  a 
balun  characterization.  Thus,  each  individual 
measurement  strongly  influences  the  model  out¬ 
come,  and  the  S  matrix  element  values  can  be 
substantially  incorrect  as  the  result  of  just  one  in¬ 
accurate  measurement  value.  Work  is  in  progress 
on  an  extension  of  this  procedure  which  utilizes 
more  than  three  loads,  so  that  the  scattering  ma¬ 
trix  values  follow  from  the  solution  of  an  overde¬ 
termined  system  of  equations  in  a  least  squares 
sense.  It  is  reasonable  to  believe  that  an  overde- 
termined  system  will  ameliorate  the  adverse  ef¬ 
fects  of  isolated  ‘calibration  data’  errors  on  the 
model,  and  so  will  bring  analysis  and  predictions 
for  arbitrary  complex  loads  into  closer  agreement 
with  actually  observed  balun  electrical  behavior. 

The  methodology  here  requires  only  the  col¬ 
lection  of  511  data,  i.e.,  one-port  measurements. 
Other  work  in  progress  concerns  the  potential  for 
improvement  in  balun  characterization  through 
the  use  of  two-port  vector  network  analyzer  mea¬ 
surements.  It  is  plausible  to  expect  that  the  si¬ 
multaneous  measurement  of  four  complex  vari¬ 
ables  (5ii,  Si2,52i,  and  ^22)  should  provide  more 
relevant  information  than  the  measurement  of 
just  one  (5ii),  which  may  be  applied  to  obtain 
a  more  reliable  and  accurate  balun  characteriza¬ 
tion. 

As  a  result  of  this  work,  baluns  can  be  included 
in  NEC  modeling  and  analysis  of  wire  antenna 
structures  without  trepidation. 
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Abstract 

Based  on  an  approximate  dyadic  diffraction  coefficient,  equivalent  currents  (ECs) 
are  derived  for  computing  the  scattering  by  a  finite  length  impedance  wedge  of  arbitrary- 
angle.  The  derived  equivalent  currents  are  implemented  in  a  standard  general  purpose 
PTD  (Physical  Theory  of  DiflFraction)  code  and  results  are  presented  demonstrating 
the  accuracy  of  the  formulation  for  a  number  of  impedance  and  (dielectrically)  coated 
structures.  These  include  typical  shapes  such  as  plates,  finite  length  cones  and  cylinders 
which  have  been  partially  or  fully  coated. 


1  Introduction 

In  a  recent  paper,  Syed  and  Volakis  [1]  presented  diffraction  coefficients  for  an  impedance 
wedge  (of  infinite  length)  illuminated  at  skew  incidence  as  shown  in  Fig.  1.  Having  such  a 
diffraction  coefficient,  it  is  now  possible  to  consider  the  scattering  by  complex  impedance 
structures  in  the  context  of  the  geometrical  or  physical  theories  of  diffraction  (GTD  or  PTD). 
Example  applications  include  the  scattering  by  non-metallic  airborne  structures  and  signal 
fading  evaluation  for  mobile  communication  or  propagation  over  mountainous  terrain.  A 
study  of  the  latter  was  performed  by  Schneider  and  Luebbers  [2]  using  heuristic  impedance 
wedge  diffraction  coefficients. 

In  this  paper  we  consider  the  application  of  the  diffraction  coefficients  developed  in  [1] 
to  scattering  by  impedance  bodies.  A  key  element  to  this  application  is  the  development 
of  equivalent  currents  (ECs)  or  incremental  length  diffraction  coefficients  (ILDCs)  for  im¬ 
pedance  wedges.  The  ECs  were  first  introduced  by  Ryan  and  Peters  [3]  to  compute  the 
scattering  by  a  finite  edge  from  a  knowledge  of  the  corresponding  infinite  edge  diffraction 
coefficients.  They  were  later  improved  by  Michael!  [4], [5]  for  angles  off  the  Keller  cone,  and 
are  basically  fictitious  currents  placed  at  the  edge  of  a  wedge  such  that  they  generate  the 
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far  zone  diffracted  field  of  a  straight  infinite  edge.  For  a  finite  length  edge,  the  currents  are 
placed  over  the  length  of  that  edge  and  are  integrated  to  yield  an  approximation  of  the  edge 
scattered  field.  In  the  context  of  PTD  [6], [7], [8]  the  role  of  the  ECs  is  played  by  the  ILDCs 
introduced  by  Mitzner  [7]  (see  also  Butorin  and  Ufimtsev  [9]).  The  ILDCs  are  applied  to 
each  element  of  the  edge  and  generate  a  diffracted  field  which  must  be  added  to  the  physical 
optics  (PO)  contribution  from  the  adjacent  surfaces  to  produce  the  total  scattered  field. 

To  compute  the  PTD  field,  the  PO  field  is  calculated  first  by  dividing  the  structure 
into,  say,  triangular  facets  or  curvilinear  patches  and  the  PO  current  is  then  integrated  over 
each  facet  or  patch.  The  diffracted  field  or  fringe  wave  is  added  by  integrating  the  ILDC 
over  the  specified  edges  of  the  structure  (see  for  example  [10]  and  [11]).  In  the  following 
we  will  first  present  explicit  forms  of  the  ILDCs  for  impedance  wedges  before  proceeding 
with  their  application  to  scattering  by  three  dimensional  impedance  structures.  Previously, 
two  approaches  have  been  used  for  determining  the  ILDC  or  EC  expressions.  The  one 
introduced  by  Michaeli  [4], [5]  and  Mitzner  [7]  for  metallic  edges  is  more  rigorous  and  has 
been  extended  by  Pelosi  et  al.  [12]  for  impedance  wedges.  The  latter  have  yet  to  be  used 
in  practice  because  of  a  lack  of  skew  incidence  impedance  wedge  diffraction  coefficients  and 
could  indeed  be  employed  in  our  analysis  in  conjunction  with  the  diffraction  coefficients  given 
in  [1].  However,  the  resulting  expressions  are  quite  involved  and  their  improved  accuracy 
does  not  appear  to  be  justified.  Instead,  we  will  make  use  of  the  simpler  ILDCs  derived 
using  the  method  discussed  by  Knott  and  Senior  [13]. 


2  ILDCs  for  a  straight  impedance  wedge 

Consider  the  plane  wave 

E’  = 

illuminating  the  wedge  shown  in  Fig.  1  where 

I  =  —X  cos  sin  —  y  sin  sin  +  z  cos  0'”*^ 


(1) 


is  the  direction  of  incidence  and  r  —  xx  yy  +  zz. 

For  an  infinite  impedance  wedge,  the  diffracted  field  can  be  written  as 

= - j=~V.E^  (2) 

where  _ 

T>  =  (3) 

is  the  dyadic  diffraction  coefficient  for  the  impedance  wedge  given  in  [1],  s  denotes  the 
distance  between  the  diffraction  and  observation  points;  s  —  0  x  ^  and  t  —  0'  x  cp"  with 
0  =  0'  for  the  infinite  wedge.  Note  that  all  coordinates  in  (3)  are  with  respect  to  the  wedge 
(local)  coordinate  system  illustrated  in  Fig.  1.  In  contrast  to  the  PEC  wedge,  the  cross¬ 
polarization  elements  of  the  dyadic  diffraction  coefficient  matrix  are  non-zero  and  this  is  the 
reason  for  much  of  the  complication  associated  with  impedance  structures.  As  expected,  the 
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dyadic  diffraction  coefficients  are  functions  of  the  local  wedge  coordinates  (j>,  and  as 
well  as  the  upper(+)  and  lower(-)  wedge  impedances  77+  and  77_  respectively. 

To  compute  the  diffracted  field  from  a  finite  impedance  wedge  in  the  context  of  the 
method  of  equivalent  currents  we  introduce  the  equivalent  electric  and  magnetic  currents  P 
and  respectively,  such  that 

=  jko -  /  \Zo  r(r')i  X  (J  X  t)  +  /"'(r')  s  x  U  dz'  (4) 

47rr  j-t.12  ^  ^ 

recovers  the  diffracted  field  (2)  when  /  — >•  00.  As  illustrated  in  Fig.  1,  s  denotes  the  direction 
of  diffraction  given  by 


J  =  rr  sin  6^  cos  (^  +  y  sin  0^  smc^  +  z  cos  0^ 


i  is  the  vector  tangent  to  the  edge  and  the  spherical  angles  {9^,<P)  refer  to  the  global 
(unprimed)  coordinate  system  defined  in  Fig.  2. 

It  is  easy  to  show  via  the  stationary  phase  method  that  (4)  reduces  to  (2)  for  I  — >■  00 
only  when 


/">(.-)  =  +  (5) 

These  fictitious  equivalent  currents  can  in  turn  be  used  in  (4)  to  obtain  the  scattered  field 
of  a  finite  length  edge.  For  a  straight  edge,  the  diffraction  coefficients  are  independent  of  z 
and  on  this  basis  (4)  and  (5)  simplify  to 

p-jkor  _  i.i/2  ,  ,, 

u.E^  =  - {V-e') 

r  J-£/2 

=  - (!P.e‘)£sinc  ||A:o^(s  —  ?).f|  (6) 

where  u  denotes  an  arbitary  unit  vector  such  that  s.u  =  0  as  required  for  far  zone  computa¬ 
tions.  Also, 

+  u.{s  X  i)  -  {i  x  (7) 

and  we  have  made  use  of  the  incident  field  components 


4(r') 


p-jkoi.r' 

sin/?i 


{i  X  ?).e‘ 

sin  /3‘ 


g-jfcot-.r' 


The  tilde  in  (7)  was  introduced  in  preparation  for  the  definition  of  the  ILDC.  If  we  set 

Duv  =  Duv  (8) 


78 


where  {Duv}  —  D  is  the  dyadic  diffraction  coefficient  defined  in  (3),  then  (2)  gives  the 
diffracted  field  in  the  context  of  the  method  of  equivalent  currents.  On  the  other  hand  if  we 
set  _ 

=  D,,  -  Dlt  (9) 

where  is  the  physical  optics  diffraction  coefficient,  (2)  gives  t^  fringe  wave  field  in  the 
context  of  PTD  [6], [7]  and  V.e'  is  the  ILDC.  The  dyadic  coefficient  V  then  serves  to  remove 
the  contribution  of  the  physical  optics  current  at  the  edge  since  the  latter  is  already  included 
in  the  PO  field.  The  PO  diffraction  coefficient  for  an  impedance  wedge  can  be  derived  by 
following  the  procedure  discussed  in  [4]  and  will  be  presented  at  the  conference.  Also,  due 
to  space  limitations  we  skip  the  presentation  of  the  ILDCs  for  curved  edges.  They  can  be 
derived  by  tessellating  the  curved  edge  into  small  straight  edge  segments  and  applying  (8) 
to  each  segment.  The  total  scattered  field  from  the  curved  edge  is  subsequently  obtained 
by  summing  all  small  edge  contributions  comprising  the  entire  curved  edge.  This  process, 
although  simple  in  concept,  can  involve  several  trigonometric  factors  needed  to  transform  all 
vector  contribitions  to  a  common  coordinate  system. 


3  Applications 

The  derived  equivalent  currents  have  been  implemented  in  a  standard  PTD  code  [14]  which 
includes  only  first  order  high  frequency  fields  for  far  zone  computations.  Three  geometries 
(cone,  finite  cylinder  and  plate)  were  considered  for  validating  the  presented  PTD  formula¬ 
tion  and  diffraction  coefficients  for  impedance  structures,  and  in  all  cases  the  coatings  were 
simulated  by  choosing  r)±  =  j  tan(A;o^/e^"/7rf)  ■  Fig.  3  shows  the  backscatter  RCS  compu¬ 
tations  for  a  cone  geometry  whose  base  is  3A  in  diameter.  We  observe  that  the  agreement 
between  the  moment  method  and  PTD  is  indeed  very  good  in  the  main  lobes  but  deteriorates 
slightly  in  the  sidelobe  regions.  However,  since  the  disagreements  in  the  side  lobe  regions 
persisted  for  the  coated  and  uncoated  cone  (results  not  shown),  it  is  primarily  attributed  to 
higher  order  interactions  and  not  due  to  the  impedance  simulation  of  the  coating.  Results  for 
a  partially  coated  cone  are  shown  in  Fig.  4  for  the  more  challenging  VV  polarization  with  the 
cone’s  base  left  uncoated  (i.e  perfectly  conducting).  As  expected,  because  the  uncoated  base 
supports  multiple  wave  interactions,  the  PTD  pattern  for  this  polarization  is  less  accurate 
(than  the  horizontal  polarization)  near  tip-on  incidences.  The  horizontal  polarization  results 
(not  shown)  were  in  much  better  agreement  with  the  moment  method  data  in  the  same  re¬ 
gion.  However,  when  the  base  is  illuminated,  the  PTD  patterns  for  both  polarizations  are  in 
excellent  agreement  with  the  moment  method  data.  RCS  calculations  for  other  geometries 
will  be  presented  at  the  conference. 
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Figure  2;  Curved  edge  illuminated  by  a  plane  wave 


Incident  angle  0  (deg) 


Figure  3:  RCS  of  a  flat  based  cone  due  to  incident  (HH)  polarizaiton  with 

a  =  15',D  =  3A,0'  =90°;  Coating  parameters:  =  1  -  =  2-/ 

thickness=  0. 04  A. 


Figure  4:  E^i  (W)  polarization  RCS  of  a  partially  coated  flat  base  cone  having 

perfectly  conducting  base  with  <X  =  15°,  D  =  3A,  £^.  =  1  —  j\.  5,fl^  =2  —  j] 
coating  thickness=0.04A,  0'  =90°. 
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Abstract.  A  model  wdiich  combines  the  Geometrical  Theory  of  Diffraction  (GTD)  with  an  efficient  ray 
tracing  method  based  on  the  Shooting  and  Bouncing  Ray  method  (SBR)  is  presented.  The  model 
includes  ray  paths  with  up  to  two  diffractions  and  an  arbitrary  number  of  reflections  in  making 
predictions.  Path  loss  predictions  at  900  and  1900  MHz  are  compared  to  measurements  made  within 
an  approximately  500  by  500  meter  area  of  Rosslyn,  Virginia.  Measurements  were  made  using  street 
level  and  rooftop  transmitters  with  street  level  receivers.  For  the  street  level  transmitters  the  predictions 
were  made  using  a  two-dimensional  version  of  the  model  which  assumes  the  buildings  are  vertically 
uniform  and  neglects  diffraction  over  rooftops  and  terrain  effects.  The  predictions  for  rooftop 
transmitters  were  made  using  a  fully  three-dimensional  version. 

Introduction 


With  high  frequency  portable  communication  systems  becoming  more  common,  the  need  for 
fast  and  accurate  predictions  of  high  frequency  radio  propagation  in  urban  environments  has  rapidly 
increased.  Until  recently  such  predictions  were  usually  based  on  empirical  methods,  especially  if  a 
relatively  large  number  of  predictions  were  required.  But  advances  in  both  computational  methods  and 
computer  speeds  now  allow  for  fast,  site-specific  predictions  of  radio  propagation  even  in  a  complex 
urban  area.  In  this  paper  we  are  primarily  concerned  with  narrow  band  signal  attenuation,  but  the 
methods  to  be  discussed  are  also  applicable  to  signal  delay  and  delay  spread  predictions. 

When  applied  to  urban  areas,  a  propagation  model  which  only  includes  line-of-sight  and 
multiply  reflected  rays  can  not  yield  accurate  predictions  when  the  receiver  location  is  in  the  shadow 
of  a  large  building.  For  these  cases  accurate  predictions  require  determining  the  fields  scattered  by  the 
building  into  the  shadow  region.  The  Geometrical  Theory  of  Diffraction  (GTD)  has  been  applied  for 
over  a  decade  in  making  accurate  site-specific  propagation  prediction  over  hilly  terrain  [1]  and  has 
more  recently  been  applied  to  making  predictions  in  urban  areas  [2,3].  In  these  papers  the  ray  paths 
to  a  field  point  have  been  found  using  the  generalized  Fermat's  principle  and  the  method  of  multiple 
images.  For  the  urban  propagation  situation  considered  here  with  hundreds  or  perhaps  thousands  of 
building  faces  and  a  large  number  of  receiver  locations,  the  method  of  multiple  images  could  require 
large  amounts  of  computer  time. 

To  avoid  this  a  hybrid  model  which  combines  GTD  with  the  Shooting  and  Bouncing  Ray 
(SBR)  method  has  been  developed.  A  two-dimensional  (2D)  "canyon"  version  of  the  model  is  used 
when  transmitter  and  receiver  antennas  are  at  heights  below  the  building  heights  and  the  terrain  is 
relatively  flat,  otherwise  a  folly  three-dimensional  (3D)  version  which  includes  rooftop  diffraction  and 
terrain  effects  is  used. 
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The  SBR  method  was  developed  by  Professor  Lee  of  the  University  of  Illinois  and  co  workers 
[4],  Prior  to  this  development  the  GTD  was  the  preferred  ray  method  for  calculating  high  frequency 
radar  scattering  from  aircraft  and  other  complicated  structures.  But  as  the  amount  of  detailed  geometry 
infomation  available  from  Computer  Aided  Design  data  bases  increased,  the  algorithms  for  finding 
the  GTD  rays  were  not  efficient  enough  to  find  the  ray  trajectories.  In  the  GTD  approach,  the  rays 
traveling  from  the  transmitter  to  the  receiver  must  follow  precise  paths  determined  by  the  extended 
Fermat's  principle.  This  prescribes  that  each  ray  path  must  have  a  length  that  is  a  local  minimum, 
maximum,  or  inflection,  and  when  thousands  of  flat  reflecting  surfaces  are  involved,  finding  these 
particular  rays  takes  a  large  amount  of  computer  time.  Furthermore,  for  each  receiver  location  the 
process  must  be  repeated,  since  the  ray  paths  will  change. 

The  SBR  method  avoids  this  by  "shooting"  large  numbers  of  rays  from  the  transmitter  location. 
These  rays  are  shot  without  regard  for  the  receiver  location.  They  then  "bounce"  off  the  reflecting 
surfaces  following  the  laws  of  reflection.  Since  no  particular  ray  paths  are  being  searched  for,  the  path 
for  each  ray  is  found  extremely  quickly.  Many  more  rays  are  considered  than  with  GTD,  but  each  SBR 
ray  is  much  faster  to  compute  since  the  shooting  and  bouncing  process  is  very  straightforward  for  a 
geometry  which  consists  entirely  of  planar  surfaces. 

Using  the  SBR  method,  one  cannot  initially  evaluate  the  field  at  a  particular  point,  since  one 
cannot  be  guaranteed  that  all  of  the  important  SBR  rays  will  pass  through  that  point.  One  must  use  a 
collection  region  enclosing  the  point,  and  then  use  the  rays  that  pass  through  this  collection  region  to 
determine  the  predicted  field  strength.  In  2D  the  collection  regions  are  circles  or  rectangles  and  in  3D 
spheres  or  rectangular  boxes  are  used.  The  size  of  the  regions  should  be  chosen  to  be  much  smaller 
than  the  distance  characterizing  the  slow  fading  along  the  streets.  Results  in  this  paper  were  made 
using  collection  regions  five  meters  across. 

All  of  the  rays  passing  through  the  collection  area  are  sorted  and  combined  to  determine  the 
predicted  field  strength.  The  sorting  is  done  to  select  just  one  SBR  ray  for  each  unique  path.  For 
example,  perhaps  several  line-of-sight  rays  are  intercepted  by  the  collection  region.  The  sorting  process 
finds  all  of  these,  and  the  one  line-of  si^t  ray  passing  closest  to  the  center  of  the  collection  region  will 
be  selected.  The  sortmg  process  may  also  find  several  rays  that  were  collected  after  bouncing  from  the 
same  set  of  building  faces.  Again,  from  these  rays  the  one  closest  to  the  center  of  the  collection  area 
is  selected. 

We  assume  that  the  density  of  the  SBR  rays  has  been  adjusted  for  the  collection  area  size  so  that 
at  least  two  to  three  rays  for  each  unique  ray  path  are  intercepted.  In  order  to  set  the  number  of  rays  to 
be  shot  we  use  the  following  approximate  rule.  Let  d  be  the  longest  possible  distance  traveled  by  a  ray 
(approximately  3  times  the  distance  across  the  area  of  buildings  being  considered),  let  L  be  the  length 
of  one  side  of  the  collection  area  in  the  same  units  as  d,  and  let  n  be  the  desired  minimum  number  of 
rays  for  each  distinct  path  (usually  2  or  3)  to  be  intercepted  by  the  collection  area.  Then  an  estimate 
of  the  number  N  of  rays  to  be  shot  from  the  source  is 

L 


In  2D  the  rays  are  shot  on  a  single  horizontal  disk.  In  3D  the  rays  are  shot  on  many  cones,  and  the 
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number  of  rays  on  a  cone  can  be  reduced  as  the  cone  angle  decreases. 

In  addition  to  the  relative  speed  for  computing  individual  SBR  rays,  there  is  another  advantage. 
With  one  "shooting"  the  SBR  rays  for  all  the  receiver  locations  are  found.  That  is,  the  rays  are  "shot" 
without  regard  to  receiver  location,  and  then  the  rays  going  through  the  collection  region  surrounding 
a  particular  receiver  location  are  collected  and  combined  to  determine  the  field  strength  at  that  location. 
To  consider  another  receiver  location,  one  merely  changes  the  collection  region  and  repeats  the  process 
on  the  rays  going  through  the  new  region.  A  computer  work  station  typically  has  enough  memory  to 
store  all  the  ray  trajectories  for  each  source,  so  that  the  computer  hard  disk  need  not  be  read  during  this 
process.  Thus  once  the  rays  are  shot  and  bounced,  finding  the  signal  levels  for  a  large  number  of 
receiver  locations  takes  relatively  little  computer  time.  While  the  SBR  method  is  not  particularly 
efficient  if  the  field  amplitude  is  only  evaluated  at  a  few  locations,  it  becomes  increasingly  efficient 
as  more  field  points  are  included. 

Once  the  paths  to  each  field  point  location  are  found  from  the  SBR  process  the  amplitudes  are 
easily  evaluated.  If  we  let  point  0  denote  the  location  of  the  transmitter,  point  1  the  first  reflection 
point,  N  the  last  reflection  point,  R,j  the  reflection  coefficient  for  the  nth  reflection,  rop  the  total  distance 
traversed  by  the  ray  between  in  traveling  fi'om  the  source  to  the  field  point,  and  Eo(0,(l))  be  the 
amplitude  of  the  electric  field  1  meter  firom  a  unity  gain  transmitting  antenna  in  the  direction  in  which 
the  ray  initially  travels.  Then  the  complex  amplitude  of  the  field  for  the  qth  path  is  given  by 

E  (2) 

’  V  - 

where  in  general  the  1^  reflection  coefficients  may  be  angle  and  polarization  dependent.  For  the  3D 
case  the  electric  fields  are  2D  vectors  (one  term  for  each  polarization),  the  reflection  coefficients  are 
dyads,  and  the  polarization  vectors  must  be  converted  at  each  reflection  into  the  ray-fixed  coordinates 
suitable  for  application  of  the  reflection  coefficients. 

In  2D,  for  each  SBR  ray  that  travels  directly  to  the  receiver  location  as  described  in  equation 
(2),  another  will  reach  the  receiver  after  being  reflected  from  the  ground.  So  if  there  are  a  total  of  Q 
paths  found  by  the  SBR,  there  would  be  an  additional  Q  ground  reflected  paths.  In  order  to  calculate 
the  length  of  this  path  the  source  point  will  be  located  at  point  O',  which  is  the  image  of  point  0  in  the 
ground  surface.  Using  the  image  has  the  effect  of  removing  the  ground  from  the  calculation,  and 
extending  the  buildings  below  the  former  ground  level.  The  electric  field  amplitude  is  evaluated  using 
an  equation  similar  to  (2),  but  modified  to  include  the  ground  reflection  coefficient  R^.  The  equation 
for  the  amplitude  of  the  qth  ground  reflected  SBR  ray  E'^  which  undergoes  N  reflections  in  addition  to 
the  ground  reflection  is 

£„(e',4.')4  — -flfi  (3) 

"-I 

In  order  for  the  direct  and  ground  reflected  SBR  rays  to  combine  properly,  the  phase  information  in 
(2)  and  (3)  must  be  retained  by  combining  Eq  and  E,'  coherently.  The  uncorrelated  paths  are  then 
combined  incoherently  without  including  phase,  and  the  average  total  field  due  to  all  the  Q  paths  is 
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given  by 


(4) 


If  the  field  strengths  at  all  receiver  points  were  determined  by  line-of-sight  and  reflected  energy,  then 
the  approach  described  above  would  be  sufficient  to  determine  the  field  levels.  However,  for  many 
receiver  locations  diffracted  fields  make  an  important  contribution.  For  these  receiver  locations  the 
SBR  method  must  be  augmented  by  adding  diffraction.  The  predictions  are  made  by  combining  the 
SBR  method  with  the  Geometrical  Theory  of  Diffraction  (GTD). 


SBR/GTD  Hybrid  Method 

The  Geometrical  Theory  of  Diffraction  (GTD)  was  originated  by  Joseph  Keller  [5].  As  with 
the  SBR  method,  it  uses  rays  to  predict  electromagnetic  scattering  fi-om  conducting  objects.  But  while 
SBR  includes  line-of-sight  and  multiple  reflections,  the  GTD  also  includes  edge  diffracted  rays.  In  the 
approach  described  here,  the  SBR  is  used  to  determine  the  ray  paths,  and  GTD  to  evaluate  the 
amplitudes  of  the  diffracted  rays.  Within  the  framework  of  the  GTD,  the  Uniform  Theory  of 
Diffraction  wedge  diffraction  coefficient  developed  by  Kouyoumjian  and  Pathak  [6]  and  modified  for 
imperfect  reflection  due  to  finite  conductivity  and  local  surface  roughness  [1,7]  is  widely  used.  For 
multiple  diflB-action  the  spreading  factors  ^ven  in  [1]  are  used,  although  the  UTD  is  not  strictly  accurate 
for  multiple  diffraction. 

As  outlined  above,  the  hybrid  SBR/GTD  approach  considered  here  first  shoots  many  SBR  rays. 
Each  of  these  rays  is  then  bounced  through  the  building  geometry.  After  the  rays  are  determined  they 
are  sorted  by  building  interaction.  In  order  to  find  the  diflfi-acting  edges  (formed  by  building  comers), 
this  sorting  process  finds  adjacent  rays  that  interact  differently  with  the  building  geometry.  For 
example,  let  us  assume  that  of  two  adjacent  SBR  rays  one  is  reflected  from  a  certain  building  face  while 
the  other  is  not.  This  means  that  a  diffracting  edge  of  a  building  face  lies  between  these  two  rays.  It 
is  then  quite  simple  to  locate  a  diffraction  point  on  this  edge. 

For  a  2D  calculation  each  diffracting  building  comer  will  have  one  disk  of  diffracted  rays.  But 
for  3D,  different  rays  shot  from  the  transmitting  antenna  •will  have  diffraction  points  at  different 
locations  along  the  same  diffracting  edge.  In  GTD,  each  diffraction  point  is  then  the  source  of  a  cone 
of  diffracted  rays.  For  a  2D  calculation,  this  cone  degenerates  to  a  disk  of  rays.  Usually  when  GTD 
is  applied  only  a  singe  ray  of  the  cone  or  disk  is  used,  since  the  ray  trajectory  from  the  diffraction  point 
to  the  field  point  is  known.  However,  applying  SBR  at  each  diffraction  point  involves  "shooting"  a 
partial  cone  (a  disk  in  two  dimensions)  of  diffracted  rays  in  the  region  external  to  the  building.  These 
rays  then  bounce  through  the  building  geometry  just  as  if  they  were  "shot"  from  the  original  source 
location,  except  that  their  amplitude  and  phase  are  determined  by  the  GTD  wedge  diffraction 
coefficients.  For  2D  there  is  also  a  companion  ray  that  follows  the  same  ray  path  except  that  it  comes 
from  the  image  of  the  diffraction  point  in  the  ground  and  is  reflected  once  by  the  ground.  For  3D 
calculations  the  ground  reflected  rays  are  treated  separately,  as  for  the  non-diffracted  rays.  For  double 
diffraction  the  same  process  is  continued  to  locate  the  edges  for  the  second  diffraction  and  to  shoot  the 
resulting  cones  (2D  disks)  of  diffracted  rays. 
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Let  us  extend  the  equation  given  above  in  (2)  to  a  case  of  single  diffraction  in  2D.  Consider  that 
a  particular  building  comer  has  been  identified  as  the  source  of  a  diffracted  ray.  Further,  let  this 
building  comer  be  illuminated  by  a  ray  that  originally  came  from  the  actual  source,  and  that  has  been 
reflected  N  times,  with  the  notation  as  given  in  conjunction  with  (2)  above.  Then  assume  that  his  ray 
is  further  reflected  M  times  as  it  bounces  from  the  diffracting  building  edge  to  the  collection  area  near 
the  field  point.  Then  for  the  qth  such  ray  the  electric  field  at  the  field  point  will  be  given  by 
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(5) 


For  3D  calculations  the  electric  fields  are  vectors  and  the  reflection  and  diffraction  coefficients  are 
dyads  so  that  the  polarization  information  is  included. 

Equation  (5)  includes  three  separate  aspects  of  propagation.  First,  the  ray  propagates  from  the 
original  source  with  amplitude  £„  (adjusted  for  the  antenna  radiation  pattern  if  necessary)  and 
undergoes  N  reflections  until  it  reaches  the  diffracting  edge  with  rop  the  distance  from  the  source  to  the 
diffracting  edge.  Then  it  is  diffracted  by  the  edge  with  GTD  diffraction  coefficient  D,  and  with  its 
energy  spreading  changed  by  the  spreading  factor  given  in  the  square  root.  Then  it  propagates  from 
the  diffracting  edge  to  the  field  point  F,  undergoing  M  additional  reflections  along  the  way.  For  a  2D 
calculation  (5)  is  modified  as  in  (3)  to  give  the  amplitude  of  the  corresponding  ground  reflected  ray, 
and  the  two  amplitudes  are  combined  coherently  as  in  (4).  Equation  (5)  can  be  extended  to  provide  a 
reasonable  approximation  to  double  diffraction  by  using  the  spreading  factors  given  in  [1]. 


Results 

In  this  section  of  the  paper  the  2D  and  3D  predictions  are  compared  to  measurements  made  in 
Rosslyn,  Virginia  in  March  and  July  of  1995  by  a  group  at  AT&T  Bell  Laboratories.  The 
measurements  were  made  at  hundreds  of  locations  along  the  streets  within  an  approximately  500  by 
500  meter  area  of  Rosslyn.  All  the  predictions  were  made  totally  blind,  that  is,  the  predictions  were 
made  and  submitted  to  AT&T  before  any  path  loss  data  were  supplied.  Additional  information  about 
how  the  measurements  were  made,  and  some  other  comparisons  with  different  propagation  models,  is 
included  in  [8]. 

The  approximate  building  plan  used  for  the  predictions  is  shown  in  Figure  1.  For  the  2D 
predictions  the  buildings  do  not  vary  in  the  vertical  direction,  but  for  the  3D  calculations  any  vertical 
variation  of  the  buddings  was  included  in  the  calculations.  The  receiving  antenna  was  located  on  a  van, 
with  the  antenna  approximately  2.3  m  above  local  ground.  The  paths  driven  by  the  receiving  van  are 
shown  in  Figure  1.  Measurements  were  made  at  frequencies  of  900  and  1900  MHz. 

All  antennas  were  vertically  polarized.  For  the  3D  calculations  the  antenna  pattern  of  the 
transmitting  anterma  was  highly  directional,  and  was  included  in  the  predictions.  The  3D  receiving 
antenna  had  a  vertical  dipole  (sin  0)  pattern.  For  the  2D  calculations  the  vertical  antenna  patterns  were 
not  included,  and  the  horizontal  patterns  were  omnidirectional.  The  path  loss  Lp  defined  by 

L^.P^.P,-G,-G^-L^  (6) 
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where  P,  is  the  received  power,  P,  is  the  transmitted  power,  Gt  is  the  maximum  transmit  antenna  gain, 
Gjthe  maximum  receive  antenna  gain,  is  the  cable  loss,  all  given  in  dB.  Thus  the  path  loss  depends 
only  on  the  building  and  terrain  geometry  and  the  directional  gains  of  the  antennas. 

For  the  2D  example  the  transmitting  antenna  was  located  at  site  2B  on  Figure  1,  supported  by 
a  tower  at  a  height  of  10.0  m  above  local  ground.  For  the  3D  examples  the  transmitting  antennas  were 
located  on  buildings  at  sites  5  and  6  on  Figure  1.  These  antennas  were  located  2  m  above  the  building 
roofs  and  approximately  3  meters  from  the  roof  edge.  Diffraction  from  the  edge  of  the  roof  near  the 
transmitting  antenna  was  not  included  in  the  calculations.  The  antenna  height  at  site  5  was 
approximately  42  m  above  local  ground,  and  at  site  6  approximately  43  meters  above  local  ground. 
These  latter  two  heights  are  above  many  of  the  nearby  buildings,  so  that  propagation  over  the  buildings 
is  important. 

A  typical  2D  prediction  result  for  the  10  m  high  transmitting  antenna  at  site  2B  at  900  MHz  is 
shown  in  Figure  2  for  the  receiving  van  driven  along  the  19th  Street/Nash  Street  path.  The  first  225 
m  of  this  path  is  a  line-of-sight  region,  and  the  remainder  must  be  reached  by  multiple 
reflection/diffiaction.  A  3D  prediction  for  transmitter  site  5  at  1.9  GHz  is  compared  with  measurements 
in  Figure  3  for  the  Ft.  Myer  Dr.  van  path.  This  path  also  involves  both  line  of  sight  and 
reflection/diffraction  regions.  Finally,  a  pair  of  predictions  for  transmitter  site  6  at  900  MHz  is  shown 
in  Figures  4  and  5  for  Lynn  St.  and  Kent  St.  respectively.  Going  from  Lynn  St.  to  Kent  St.  introduces 
an  additional  block  of  buildings  between  the  transmitting  and  receiving  antennas.  The  3D  prediction 
is  quite  accurate  in  showing  the  additional  attenuation. 
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(UJ)A 

Figure  1:  Plan  view  of  Rosslyn  VA  showing  the  building  locations,  transmitter  sites,  and 
paths  driven  by  the  receiving  van 
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Figure  2:  Prediction  by  2D  SBR/GTD  model  at  900  MHz  for  transmitting  antenna  at  site 
2B  located  at  10  m  above  local  ground  for  the  19th  St./Nash  St.  receiving  van  path 


Figure  3:  Prediction  by  3D  SBR/GTD  model  at  1.9  GHz  for  transmitting  antenna  at  site 
5  located  on  a  roof  42  m  above  local  ground  for  the  Ft.  Myer  Dr.  receiving  van  path 
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Figure  4:  Prediction  by  3D  SBR/GTD  model  at  900  MHz  for  transmitting  antenna  at  site 
6  located  on  a  roof  43  m  above  local  ground  for  the  Lynn  St.  receiving  van  path 


Figure  5:  Prediction  by  3D  SBR/GTD  model  at  900  MHz  for  transmitting  antenna  at  site 
6  located  on  a  roof  43  m  above  local  ground  for  the  Kent  St.  receiving  van  path 


Analysis  of  Dielectric  Structures 
Using  the  NEC-BSC 


R.  J.  Marhefka  and  L,  W.  Henderson 
The  Ohio  State  University 
Electro  Science  Laboratory 
1320  Kinnear  Road 
Columbus,  OH  43212-1 191 
Tel;  (614)292-5752 
Fax;  (614)292-7297 
Email;  ijm@tiger.eng.ohio-state.edu 


I.  Introduction 

The  NEC  -  Basic  Scattering  Code  (NEC-BSC)  has  been  used  for  over  20  years  for  the  analysis  of  high 
frequency  antenna  performance  in  complicated  structures  such  as  ships,  aircraft,  satellites,  and 
buildings.  Because  it  is  based  on  the  Uniform  Geometrical  Theory  of  Diffraction  (UTD),  it  is  able  to 
handle  large  structures  reasonably  efficiently.  It  has  been  extensively  validated  for  perfectly  conducting 
structures  in  the  past.  It  had  a  limited  dielectric  capability,  composed  primarily  of  single  multilayered 
dielectric  plates.  In  order  to  model  buildings  and  radomes,  however,  it  is  important  to  be  able  to  model 
multiple  plates  of  various  compositions  Joined  together  at  arbitrary  angles.  The  user  desires  to  know 
how  the  structure  will  affect  the  side  lobe  levels  of  the  enclosed  antenna,  especially  when  the 
transmitted  energy  can  be  multiply  bounced  within  the  enclosure  before  it  exits.  In  addition,  the 
diffraction  from  the  junctions  needs  to  be  modeled  to  ensure  continuity  of  the  predicted  fields. 

A  new  version  of  the  NEC-BSC  is  being  developed  to  handle  this  situation.  Extensive  revision  to  the 
dielectric  junction  diffraction  coefficient  has  been  undertaken.  Now  arbitrary  angles  between  dielectric 
plates  from  360  degrees  down  to  60  degrees  have  been  implemented.  This  is  accomplished  using  a 
heuristically  derived  coefficient  that  contains  appropriate  physics  depending  on  the  various  reflection 
and  transmission  boundaries  encountered. 

A  number  of  examples  will  be  presented  illustrating  how  the  results  from  the  code  can  help  the  user 
determine  antenna  performance  in  various  dielectric  enclosures.  A  new  capability  of  displaying  rays  will 
be  used  to  illustrate  how  the  building  or  radome  affects  the  patterns.  The  structures  studied  contain 
many  plates  and  sources,  yet  volumetric  patterns  can  be  obtained  in  reasonable  amounts  of  time  on 
modem  PCs  or  workstations. 
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II.  Background 

The  NEC-BSC  has  had  a  limited  dielectric/magnetic  capability  since  Version  2  [1],  The  code  would 
allow  thin  slabs  or  coated  plates  to  be  modeled.  Versions  2  can  find  the  fields  reflected  and/or 
transmitted  up  to  double  interactions.  Version  3.2  [2]  added  up  to  triple  interactions.  The  geometrical 
optics  reflection  and  transmission  coefficients  were  obtained  by  a  recursive  method  [3].  This  allows  for 
multilayered  material  slabs  to  be  efficiently  handled. 

In  both  versions,  the  diffraction  coefficient  at  the  plate  junctions  is  based  on  a  paper  by  Burnside  and 
Burgener  [4].  In  their  paper,  a  heuristic  diffraction  coefficient  is  developed  for  thin  “knife-edged”  slabs. 
For  the  NEC-BSC,  which  allows  for  plates  to  be  joined  together  at  their  edges,  the  diffraction 
coefficient  was  modified  to  accommodate  wedge  angles.  The  extension,  however,  was  limited  to 
convex  wedge  angles  vdth  the  source  and  receiver  on  the  same  side  of  the  plates.  If  the  plates  are 
convex  and  metallic  backed,  the  results  are  reasonable.  If  the  geometrical  optics  fields  are  dominant 
and/or  the  continuity  of  the  fields  are  not  important,  this  approximation  is  also  reasonable.  For  the 
general  problem,  with  arbitrary  wedge  angles,  source  and  receiver  positions,  and  a  wide  range  of 
contrasting  materials,  improvements  were  needed  for  better  accuracy. 

Since  the  solution  to  the  general  material  wedge  or  thin  wedged  slabs  is  not  available,  it  is  still 
necessary  to  base  such  a  development  on  heuristic  methods.  The  Burnside  -  Burgener  approach  is, 
therefore,  again  used  to  obtain  a  more  general  diffiaction  coefficient.  A  summary  of  the  development 
can  be  found  in  Reference  [5].  The  approach  entails  extensively  bookkeeping  the  physics  for  different 
situations  and  attaching  the  resulting  reflection  -  transmission  combinations  to  the  appropriate 
components  of  the  diffraction  coefficient.  The  results  [5]  have  been  validated  for  simple  two  plate 
problems  using  the  moment  method  [6],  with  good  engineering  agreement  in  most  situations  tried.  One 
of  the  limitations  present  in  this  approximation  arises  from  the  fact  that  surface  waves  have  not  been 
included.  This  primarily  limits  the  position  of  the  antennas  to  be  away  from  the  surface  of  the  material. 
Of  course,  if  loss  is  present  in  the  material,  surface  wave  effects  will  be  minimized  in  comparison  to  the 
optics  effects. 

In  addition  to  an  improved  algorithm  for  the  edge  diffractions,  the  reflection  and  transmission  algorithm 
has  been  generalized  to  include  resistive  and  impedance  boundary  layers.  When  this  is  fully  integrated 
into  a  new  version  of  the  NEC-BSC,  a  broader  class  of  practical  problems  can  be  more  accurately 
modeled.  For  example,  antennas  in  radomes  and  antenna  siting  and  coupling  in  an  around  building 
environments  will  be  possible.  The  next  section  provides  a  hypothetical  example. 

III.  Building  Example 

One  of  the  recent  topics  of  interest  is  the  propagation  around  and  through  buildings.  The  rapid  growth 
in  personal  communications  technology  has  increased  the  desire  to  predict  the  performance  of  systems 
in  situ.  In  general,  real  environments  are  very  complex  with  wires,  pipes,  heating  ducts,  etc.,  buried  in 
the  walls  not  to  mention  furniture  in  various  configurations.  An  analysis  tool,  like  the  NEC-BSC,  can 
provide  insight  into  the  physical  mechanisms  making  up  the  link  budget  between  transmitter  and 
receiver.  Even  a  “simplified”  model  can,  therefore,  convey  substantial  information  about  the  viability  of 
a  given  situation.  For  example,  purely  statistical  models  can  be  difficult  to  validate,  however,  individual 
physical  situations  can  be  measured  and/or  more  easily  calculated  by  alternative  means  and  compared. 
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The  individual  pieces  can  then  be  combined  in  such  a  way  as  to  give  ideal,  typical,  or  worst  case 
scenarios  leading  to  reasonably  believable  bounds. 

In  order  to  better  observe  the  capabilities  of  the  NEC-BSC  to  analyze  such  situations,  a  hypothetical 
problem  of  determining  the  affect  of  a  building  on  antenna  patterns  and  coupling  between  half¬ 
wavelength  dipoles  is  discussed.  The  NEC-BSC  Workbench™  [7]  is  used  to  display  the  geometry  and 
UTD  rays.  The  building  is  a  simplified  model  of  an  office  room  (10’xl3’x9.3’)  with  a  glass  window 
(0.125”,  81=5.23,  tan5=0.013)  and  a  open  door.  There  are  two  adjoining  areas  and  a  6’  hall  way.  The 
situation  is  depicted  in  Figure  1 .  The  building  sits  on  a  infinite  ground  plane  with  a  dielectric  constant 
of  dry  soil  (er=2.38,  tan5=0.048).  The  roof  of  the  room  is  assumed  to  be  a  layer  of  0.5”  wood  with 
appropriate  dielectric  constant  (er=3.6).  The  walls  of  the  building  are  six  inch  concrete  blocks.  The 
model  for  the  blocks  is  taken  to  be  a  triple  layer  slab.  The  outer  layers  have  the  thickness  (0.53”)  and 
dielectric  constant  of  the  concrete  (sr=3.0,  tan6=0.048).  The  inner  layer  (5. 1 1”)  is  an  average  dielectric 
constant  between  air  and  the  concrete  (81=1.5).  A  paper  by  Honcharenko  and  Bertoni  [8]  compare  the 
accuracy  of  such  a  model  versus  a  more  complex  periodic  model  they  developed  for  the  inner  layer. 
The  simplified  model  gives  good  enough  results  for  this  application. 

The  complexity  of  the  number  of  UTD  interactions  that  can  be  determined  is  illustrated  by  looking  at  a 
visualization  of  rays.  The  NEC-BSC  Workbench  in  combination  with  ray  files  produced  by  a  new 
version  of  the  NEC-BSC  can  be  used  to  depicted  the  rays.  Various  numbers  of  bounces  and/or  field 
strength  levels  can  be  shown  by  using  the  filtering  option.  A  half  wavelength  dipole  is  centered  in  the 
office  room.  A  receiver  is  situation  in  the  middle  of  the  hallway.  Figure  2  shows  the  dominant  rays 
over  a  30  dB  dynamic  range  from  the  source  to  the  receive  via  various  wall  reflections,  transmissions, 
and  diffractions.  Notice  that  if  the  filter  is  open  to  a  60  dB  dynamic  range  the  number  of  rays  greatly 
increases  as  shown  in  Figure  3 .  Notice  that  even  thought  the  situation  depicted  is  symmetric  in  terms  of 
relative  source  and  receiver  position  in  terms  of  the  walls,  the  ray  picture  is  not  symmetric.  This  is  due 
to  the  fact  that  the  reflection  coefficient  of  the  walls  is  larger  than  for  the  glass. 

The  next  example  illustrates  the  coupling  between  half  wavelength  dipoles.  Two  cases  at  different 
frequencies  of  0.9  GHz  and  3.0  GHz  are  compared.  The  coupling  between  a  dipole  in  the  center  of  the 
office  to  the  second  dipole  as  it  proceeds  down  the  center  of  the  hall  is  shown  in  Figure  4.  The  dipoles 
are  4’  off  the  floor.  The  coupling  between  the  dipole  in  the  office  to  a  second  dipole  as  it  proceeds  in 
the  same  fashion  outside  the  building  is  shown  in  Figure  5. 

The  last  example  illustrates  the  affect  the  building  has  on  the  far  zone  pattern.  Figure  6  shows  the 
elevation  pattern  at  phi=90  degrees  of  the  dipole  in  the  center  of  the  office  at  3.0  GHz  versus  the  dipole 
over  the  ground  plane  alone.  Figure  7  shows  the  same  situation  but  in  a  conic  cut  at  theta=86  degrees 
comparing  0.9  GHz  case  to  3.0  GHz  case.  Note  that  the  no  building  case  is  a  constant  for  the  conic 
cut.  The  affect  of  the  building  is  obvious  in  the  two  cases. 

IV.  Summary 

The  material  plate  model  for  the  NEC-BSC  is  being  generalized  to  accommodate  a  wider  class  of 
situations.  With  these  improvements  the  code  will  be  able  to  more  accurately  model  Joined  plates  with 
arbitrary  wedge  angles,  source  and  receiver  positions,  and  dielectric/magnetic  constituencies.  The 
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algorithms  have  been  validated  against  simple  situations  using  moment  methods.  The  determination  of 
the  accuracy  for  more  general  problems,  such  as  antennas  in  the  present  of  buildings,  etc.,  awaits 
further  validation. 
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Power  out/in  (dB) 


Figure  3.  Coupling  between  dipoles  showing  rays  over  60  dB  dynamic  range. 


Figure  4.  Coupling  between  dipole  in  room  and  dipole  in  center  of  hallway. 
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Theta 

Figure  6.  Pattern  of  dipole  with  Phi  =  90  degrees  at  3.  0  GHz. 
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Abstract 

The  present  contribution  deals  with  a  hybrid  method  combining  the  method  of  moments  (MM) 
with  the  physical  optics  (PO)  approximation  and  asymptotic  currents  based  on  Fock  theory.  This 
current  based  hybrid  approach  is  employed  to  deal  with  arbitrarily  shaped  monopole  antennas 
located  on  perfectly  conducting,  curved  convex  surfaces.  Some  examples,  e.g.  an  inverted-L 
antenna  mounted  on  an  open  truncated  cone,  demonstrate  that  by  the  hybrid  approach  a  drastic 
reduction  of  memory  requirement  and  CPU-time  can  be  achieved  as  compared  to  an  application 
of  the  MM  alone. 


1  Introduction 

Even  though  fast  computers  with  large  memories  are  available  to  antenna  design  engineers  nowa¬ 
days  it  remains  a  challenge  to  analyze  a  certain  class  of  electromagnetic  scattering  problems. 
These  are  bodies  with  geometrical  dimensions  large  as  compared  to  the  wavelength  but  requiring, 
for  instance,  an  accurate  analysis  of  a  radiating  antenna  so  that  an  application  of  pure  asymptotic 
techniques  such  cis  the  uniform  geometrical  theory  of  diffraction  (UTD)  is  out  of  question.  The 
MM  can  also  not  be  considered  because  memory  requirement  and  CPU  time  grow  proportional 
to  and  /®,  respectively,  with  /  being  the  frequency. 

A  remedy  is  found  in  hybrid  methods  combining  the  MM  with  asymptotic  techniques  [1,  2,  3]. 
In  principle  one  has  to  distinguish  between  ray  based  asymptotic  techniques  such  as  UTD  and 
current  based  techniques  like  PO  or  the  physical  theory  of  diffraction  (PTD)  [4].  Depending  on 
the  structure  to  be  analyzed  both  of  them  offer  some  advantages.  Hybrid  MM/UTD  formulations 
(e.g.  [5,  6,  7])  are  suitable  for  problems  when  e.g.  an  antenna  is  located  in  front  of  a  large  scattering 
body.  On  the  other  hand,  current  based  techniques  should  be  prefered  when  parts  of  one  large 
scatterer  are  to  be  treated  with  MM  and  the  remaining  parts  with  asymptotic  techniques.  In 
this  case  a  uniform  current  based  formulation  can  be  applied  ensuring  current  continuity  at  the 
boundary  between  the  MM-  and  the  asymptotic  region. 
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Current  based  hybrid  methods  are  summarized  in  [8].  An  application  to  three-dimensional  bodies 
can  be  found  in  [9, 10, 11],  where  PO  is  employed  in  the  asymptotic  region.  In  [11, 12]  the  authors 
proposed  corrections  of  the  PO  current  to  improve  the  accuracy  by  accounting  for  effects  of  edges 
in  the  asymptotic  region.  The  present  paper  aims  at  demonstrating  a  similar  improvement  by 
replacing  PO  by  Fock  currents  [13]  for  curved  convex  surfaces.  As  opposed  to  [14]  our  approach  is 
not  restricted  to  bodies  of  revolution  but  arbitrarily  shaped  three-dimensional  perfectly  conducting 
bodies  can  be  investigated. 


2  Hybrid  method 


Like  the  MM,  the  hybrid  method  represents  a  current  based  formulation  where  first  an  electric 
surface  current  density  J  is  determined  on  the  metallic  surfaces  including  surfaces  of  cylindrical 
wires.  Then  the  electric  and  magnetic  fields  radiated  by  J  can  be  computed  at  an  observation 
point  r  from  the  relations 

£(?)  =  £{/}  jf  J(f')-G(?,f')dA'  (1) 

TTcW 

ff(?)  =  =  /(?') .  G{r,  f’]dA'.  (2) 

A' 


with  the  free  space  Green’s  function 


G(r,  fO  = 


If*  —  r 'I 


(3) 


Here  r  denotes  the  observation  point,  r'  refers  to  the  source  point,  and  j3  =  ^  is  the  wavenumber. 

Fig.  1  shows  a  monopole  antenna  mounted  on  a  curved  convex  surface  element.  In  general,  the 
antenna  as  well  as  a  small  region  of  the  surface  around  the  antenna  (gray  shaded  in  Fig.  1)  are 
modeled  by  MM  whereas  the  current  on  the  remaining  part  of  the  surface  is  based  on  asymptotic 
expansions. 

In  the  MM-region  the  current  is  expressed  as  linear  superposition  of  basis  functions 


Nf  Ns  Nh 

jMM  =  J]  an  ■  /n  +  I]  /?n  •  S  7n  •  (4) 

n=l  Ti=l  71=1 


with  =  N f  +  Ng  Nh  denoting  the  total  number  of  basis  functions  in  the  MM-region. 

Metallic  surfaces  are  subdivided  into  electrically  small  triangular  patches.  On  these  patches  the 
rooftop  basis  functions  /„  according  to  Rao  et  al.  [15]  are  used.  Wire  antennas,  e.g.  the  monopole 
antenna  depicted  in  Fig.  1,  are  subdivided  into  small  wire  segments  where  we  employ  triangular 
basis  functions  gn-  Furthermore  special  basis  functions  are  required  to  model  the  transition 
from  segments  to  patches. 
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Fig,  1:  General  structure  of  a  monopole  antenna  mounted  on  a  curved  convex  surface  element. 

In  the  asymptotic  region  in  Fig.  1  the  current  is  expressed  as  linear  superposition  of  rooftop 
basis  functions  on  triangular  patches  as  well: 


jasym 


Qn' In 

n=Nf+l 


(5) 


This  representation  as  superposition  of  basis  functions  offers  advantages  concerning  the  continuity 
of  the  surface  current  density  across  the  boundary  between  the  MM-  and  the  asymptotic  region 
[11], 

Only  the  coefficients  an,  and  7„  in  eqn.  (4)  are  obtained  by  solving  a  system  of  linear 

equations  resulting  from  an  electric  field  integral  equation.  The  remaining  coefficients 

in  eqn.  (5),  however,  are  directly  based  on  an  asymptotic  current  representation  (PO  or  Fock 
currents),  i.e.  the  time  and  memory  consuming  process  of  solving  a  system  of  linear  equations  is 
avoided. 

The  details  of  the  hybrid  method  can  be  found  in  [11].  Here  we  will  concentrate  on  some  funda¬ 
mental  equations  only. 

From  the  boundary  condition 

Etan  =  0  (6) 

valid  on  the  whole  surface  of  the  perfectly  conducting  scattering  body  the  electric  field  integral 
equation 

glfMM-i  (7) 

1.  J  tan  <•  J  ton 

can  be  derived.  Here  Ei^tan  represents  the  tangential  component  of  the  impressed  electric  field 
strength  of  the  excitation.  When  dealing  with  monopole  antennas  Ei  might  be  caused  by  a 
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magnetic  frill  current.  The  operator  S  in  eqn.  (7)  acting  on  J  yields  the  scattered  electric  field 
according  to  eqn.  (1). 

In  the  following  the  asymptotic  current  can  always  be  expressed  as  ^function  of  the  incident 

magnetic  field  strength  H,nc-  As  an  example  consider  the  PO  current  —  2h  x  Hinc  ia  the 
illuminated  region.  The  more  general  case  with  Fock  currents  will  be  described  in  Section  3. 
Therefore  we  can  introduce  an  operator  K  so  that  the  current  in  the  asymptotic  region  is  expressed 
as 

ja,ym  ^  ^  ^  ^  ^  .  (8) 

The  incident  magnetic  field  Hinc  is  the  superposition_^of  the  impressed  magnetic  field  Hi  of  the 
excitation  and  the  magnetic  field  radiated  by  currents  J  a/m  MM-region.  It  is  very  important 
to  take  this  radiated  field  into  account  as  for  example  looking  at  the  scatterer  in  Fig.  1  the 
impressed  magnetic  field  Hi  of  the  excitation  is  mainly  confined  to  the  feed  zone  of  the  monopole 
antenna  so  that  the  current  in  the  asymptotic  region  is  caused  by  the  magnetic  field  radiated  by 
the  currents  JA/m  ^long  the  antenna. 

By  inserting  eqn.  (8)  in  (7)  we  arrive  at 

{S  +  gKH)  -  £ K  .  (9) 

With  eqn.  (4)  and  by  exploiting  linearity  of  operators  S,  H,  and  K  we  finally  obtain 

Zo,-[{e+eKH)  {/„}J 

n=l 

n=l 

Nh 

+  E 7.  ■  [(f  +  -  eK  .  (10) 

n=l 

By  applying  the  scalar  product  with  suitable  weighting  functions  (e.g.  Galerkin  method)  the 
integral  equation  (10)  can  be  converted  into  a  system  of  linear  equations  for  the  =  Nf  + 

Hg  +  Nh  unknown  coefficients  in  the  MM-region.  After  solving  this  system  of  linear  equations  the 
current  /a/m  jg  ]jnown.  Then  from  eqn.  (8)  the  current  in  the  asymptotic  region  is  obtained 

in  a  straightforward  manner.  Once  this  current  is  known  the  scattered  electric  and  magnetic  fields 
can  be  computed  with  the  aid  of  eqns.  (1)  and  (2). 


3  Asymptotic  currents 


The  simplest  possible  expression  for  the  asymptotic  current  has  already  been  briefly  men¬ 

tioned  in  Section  2.  It  is  the  physical  optics  approximation 


JPO  _  I  2  X  Hinc 

1  0 


illuminated  region 
shadowed  region 


(11) 
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with  the  normal  vector  n  on  the  surface  pointing  to  the  source.  In  this  case  the  operator  K  in 
eqn.  (8)  takes  the  form 

K  {5,~}  =  2ShX  Sine  (12) 

with  S  being  zero  or  one  depending  on  the  location  of  the  observation  point  in  the  illuminated  or 
shadowed  region,  respectively. 

The  PO  current  is  not  very  suitable  for  the  analysis  of  monopole  antennas  on  curved  convex  bodies 
as  shown  in  Fig.  1  for  two  reasons: 

•  There  is  a  discontinuity  of  at  the  shadow  boundary^. 

•  In  the  whole  shadowed  region  is  set  to  zero. 

We  employ  Fock  currents  [13]  in  order  to  overcome  these  difficulties.  They  are  continuous  across 
the  shadow  boundary  and  account  for  the  creeping  waves  in  the  shadowed  region. 

Expressions  similar  to  those  in  Refs.  [1,  3]  are  used  for  the  Fock  currents  in  the  illuminated  and  in 
the  shadowed  regioi^  but  we  have  eliminated  the  dependence  on  the  incident  electric  field  Einc  so 
that  a  formulation  =  K  according  to  eqn.  (8)  becomes  possible.  Detailed  expressions 

for  can  be  found  in  [16]. 


4  Examples 


Fig.  2:  ^-monopole  antenna  mounted  on  a  thick  rectangular  body  with  rounded  edges.  The 
dimensions  of  the  body  are  2A  x  2A  x  |. 

The  first  example  is  shown  in  Fig.  2  where  a  ^-monopole  antenna  is  mounted  on  a  thick  rectangular 
body  with  rounded  edges.  The  surface  of  the  scattering  body  has  been  divided  into  three  regions. 
The  MM-region  consists  of  the  monopole  antenna  and  the  small  gray  shaded  square  area  around 
the  antenna.  The  remaining  flat  part  of  the  top  surface  is  assigned  to  the  PO-region,  while  the 

^Please  note  that  the  location  of  the  shadow  boundary  in  Fig.  1  depends  on  the  source  point,  i.e.  on  the  location 
of  the  basis  function  under  consideration  in  eqn.  (10). 
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Fig.  3:  Vertical  radiation  pattern  of 
the  monopole  antenna  de¬ 
picted  in  Fig.  2. 


-  moment  method 

- hybrid  method 


cylindrical  and  spherical  elements  represent  the  Fock-region.  The  subdivision  of  all  surfaces  — 
independent  from  which  region  th^  are  assigned  to  —  into  triangular  patches,  which  is  necessary 
in  order  to  apply  basis  functions  /„  in  eqns.  (4)  and  (5),  is  also  displayed  in  Fig.  2. 

The  vertical  radiation  pattern  of  the  structure  in  Fig.  2  is  shown  in  Fig.  3.  The  solid  line  is  a 
reference  solution  based  on  an  application  of  the  MM  to  the  entire  structure,  while  the  dashed 
curve  represents  the  result  of  the  hybrid  method  combining  MM,  PO  and  Fock  theory.  The  results 
are  in  very  good  agreement,  while  the  memory  requirement  for  the  matrix  of  the  system  of  linear 
equations  has  been  reduced  by  a  factor  of  about  10. 


Fig.  4:  Inverted-L  antenna  mounted  on  an  open  truncated  cone. 

Another  example  taken  from  [9]  is  shown  in  Fig.  4.  An  inverted-L  antenna  is  located  on  an  open 
truncated  cone.  Here  two  different  hybrid  formulations  are  compared:  The  gray  shaded  area  in 
Fig.  4  is  treated  with  MM  while  either  PO  or  Fock  currents  are  employed  on  the  remaining  conical 
surface. 
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Fig.  5:  Vertical  radiation  pattern  of 
the  inverted-L  antenna  on  a 
cone  depicted  in  Fig.  4  (verti¬ 
cal  polarization).  Frequency 
/  =  1  GHz  and  dimensions 
ri  =  0.18  m 
r2  =  0.16  m 
hi  =  0.12m 
h2  =  0.36  m 
h  =  0.025  m 
b  =  0.050  m 


- moment  method 

.  hybrid  method  (PO) 

- hybrid  method  (Fock) 


The  vertical  radiation  pattern  in  a  plane  perpendicular  to  the  axis  of  the  cone  is  depicted  in  Fig. 
5.  The  solid  line  is  again  a  reference  solution  obtained  by  applying  MM  to  the  entire  structure. 
The  hybrid  methods  with  PO  and  Fock  currents  are  represented  by  the  dotted  and  dashed  curves, 
respectively.  It  is  obvious  that  the  hybrid  method  with  Fock  currents  is  in  much  better  agreement 
to  the  reference  solution  (solid  line)  than  the  hybrid  method  with  PO.  The  main  reason  for  this 
is  that  Fock  theory  also  accounts  for  the  currents  in  the  shadowed  region. 


5  Conclusions 

A  current  based  hybrid  method  has  been  presented  that  combines  MM  with  asymptotic  expansions 
such  as  PO  or  Fock  currents.  Even  though  the  approach  is  a  very  flexible  one  concerning  shape  and 
size  of  the  scatterer,  we  have  restricted  the  scope  of  application  in  this  paper  to  monopole  antennas 
mounted  on  curved  convex  bodies.  This  particular  geometry  requires  special  basis  functions  to 
model  a  continuous  current  flow  from  the  surface  of  the  curved  convex  body  to  the  wire  antenna. 
It  also  turned  out  to  be  advantageous  to  determine  not  only  the  currents  along  the  antenna  but 
also  the  currents  on  a  small  area  around  the  antenna  by  MM.  The  asymptotic  expansion  (PO  or 
Fock  theory)  has  been  employed  on  the  remaining  parts  of  the  surface. 

Two  examples  have  been  demonstrated  where  the  results  of  the  hybrid  method  compare  well  to 
those  of  the  conventional  MM. 
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Abstract 

The  analysis  for  the  electromagnetic  radiation  or  scattering  from  a  material  cylinder  in 
the  presence  of  a  wedge  with  impedance  faces  is  presented.  The  problem  considered  is  two 
dimensional  where  the  incident  field  is  polarized  TM  or  TE  to  the  axis  of  the  wedge,  and 
the  material  body  may  be  of  arbitrary  shape  and  inhomogeneous.  An  integral  equation 
is  formulated  where  the  field  inside  the  material  cylinder  is  the  unknown  quantity  to  be 
determined  and  the  Green’s  function  for  the  impedance  wedge  is  included  in  the  kernel  of  the 
integral  equation  which  accounts  for  the  presence  of  the  wedge.  The  integral  equation  is  then 
solved  using  a  method  of  moments  technique,  which  allows  us  to  obtain  modified  numerical 
diffraction  coefficients  to  represent  the  effect  of  the  body.  The  primary  appUcation  for  this 
analysis  is  to  study  methods  for  reducing  the  EM  scattering  from  wedge  shaped  objects; 
specifically  the  fields  which  diffract  from  the  sharp  corner  formed  by  the  edge  of  the  wedge. 
Attaching  a  lossy  material  body  to  the  tip  of  the  wedge  is  an  effective  method  for  reducing 
these  diffracted  fields. 


1.  Introduction 

The  diffraction  of  an  electromagnetic  wave  by  a  wedge  is  a  fundamental  problem  that  has  been 
extensively  studied.  The  very  general  configuration  of  a  wedge,  whose  faces  can  have  different 
impedance  boundary  conditions,  was  chosen  for  its  versatility  in  representing  more  specialized 
geometries;  i.e.,  conducting  half-planes,  right  angle  corners,  wedges  with  coated  surfaces,  etc. 
Studying  the  fields  that  diffract  from  the  edge  of  the  wedge  can  provide  valuable  insights  into  the 


^This  work  was  supported  in  part  by  the  Joint  Services  Electronics  Program  (Contract  N00014-89-J-1007)  and 
by  the  Ohio  State  University  Research  Foundation. 
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role  that  sharp  edges  have  on  the  scattering  characteristics  of  structures.  Obtaining  the  diffraction 
coefficient  for  the  wedge  can  therefore  provide  a  valuable  tool  for  incorporating  the  effect  of  edge 
diffraction  when  determining  the  scattering  from  certain  objects.  There  have  been  numerous 
asymptotic  solutions  for  the  diffraction  coefficient  for  an  impedance  wedge  [1,  2,  3];  most  of  which 
are  closed  form  solutions  that  allow  for  convenient  numerical  implementation.  However,  this  paper 
will  focus  on  the  more  complex  configuration  of  a  material  body  attached  to  the  vertex  of  a  wedge; 
an  example  of  which  is  shown  in  Figure  1.  The  problem  is  a  two-dimensional  one  where  the  wedge 
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Figure  1:  Configuration  for  a  material  body  in  the  presence  of  an  impedance  wedge. 

faces  have  different  impedance  values  given  by  Zi  and  Z2.  The  material  body  can  be  of  arbitrary 
shape  and  its  material  parameters,  e  and  fi,  may  be  inhomogeneous.  The  problem  will  then  be  to 
determine  how  incorporating  this  material  body  modifies  the  diffraction  coefficient  of  the  wedge. 
Solutions  for  these  modified  diffraction  coefficients  are,  in  general,  too  complex  to  obtain  in  closed 
form  and  therefore  will  be  obtained  numerically.  This  will  involve  formulating  an  integral  equation 
to  determine  the  scattering  from  the  material  body  in  the  presence  of  the  wedge  and  then  using 
a  numerical  technique  to  solve  it.  By  allowing  these  material  bodies  to  be  lossy,  this  analysis 
can  have  practical  applications  for  studying  methods  for  reducing  the  electromagnetic  fields  that 
diffract  from  sharp  edges. 

The  analysis  proposed  here  will  use  the  MM/Green’s  function  approach  where  the  Green’s 
function  that  is  employed  in  the  kernel  of  the  integral  equation  is  the  specialized  one  for  the 


110 


impedance  wedge.  Using  the  specialized  Green’s  function,  as  opposed  to  the  free-space  one, 
reduces  the  number  of  unknowns  since  the  boundary  conditions  along  the  wedge  faces  do  not 
have  to  be  enforced.  The  problem  is  a  two-dimensional  one  where  its  edge  is  located  along  the 
z-axis,  and  the  structure  is  infinite  in  the  z  direction.  For  the  case  when  the  fields  have  no  z- 
dependance,  it  can  be  shown  that  the  problem  can  be  reduced  to  two  scalar  problems:  namely, 
^  Ojffi  =  0  and  =  0,^2  ^  0.  Thus,  the  scalar  function  V  wiU  be  defined  as 

U  =  Eg  for  TM^,  and  U  —  Hg  for  TE^.  The  wedge  faces  have  surface  impedance  values  given 
by  Zi  and  Z2,  and  titt  is  the  exterior  wedge  angle  {n  need  not  be  an  integer).  The  faces  of  the 
wedge  satisfy  the  impedance  or  Leontovich  boundary  condition.  The  expressions  for  this  Green’s 
function  are  extremely  complicated  and  will  therefore  be  omitted  in  this  paper;  however,  the 
reader  can  refer  to  [4]  for  a  thorough  development  of  these  expressions. 

2.  Development  of  the  integral  equation 

The  configuration  to  be  analyzed  is  shown  in  Figure  1  where  Sa  is  the  region  occupied  by  the 
material  body,  Sg  is  the  source  region,  and  Sb  is  the  region  in  free  space  exterior  to  both  the  wedge 
and  5a.  The  contour  which  surrounds  Sa  has  three  segments:  La  is  the  segment  which  borders 
on  Sb,  and  and  segments  which  border  on  the  wedge  faces  1  and  2,  respectively. 

Since  the  material  body  may  be  inhomogeneous,  the  permittivity  and  permeability  are  allowed  to 
be  functions  of  position  within  5a.  An  integral  equation  is  then  formulated  which  utilizes  a  single 
field  component  as  the  unknown  variable  to  be  solved  for.  This  method  of  formulating  an  integral 
equation  was  developed  by  Jin,  et.  al.  [5].  The  formulation  that  they  developed,  as  applied  to  the 
two-dimensional  problem  used  in  this  paper,  involves  a  surface  integral  over  the  material  region 
and  a  line  integral  along  the  contour  enclosing  that  region  where  the  unknown  quantity  to  solve 
for  is  either  the  z-component  of  the  electric  field  for  the  TM^  case  or  the  z-component  of  the 
magnetic  field  for  the  TE2  case. 

An  outline  of  this  method  for  formulating  the  integrcil  equation  in  terms  of  a  single  field 
component  is  as  follows:  1)  consider  the  region  exterior  to  the  material  body  Sb  and  use  Green’s 
second  identity  to  obtain  the  relationship  between  the  field  Uin  and  the  field  Ub  along  the  contour 
La\  2)  consider  the  region  interior  to  the  material  body  5a  and  use  the  divergence  theorem  to 
obtain  a  relationship  between  the  field  along  the  contour  La  and  the  internal  field  Ua’,  3)  use  the 
boundary  conditions  along  the  contour  La  to  relate  the  exterior  field  to  the  interior  field,  therefore 
obtaining  a  relationship  between  the  field  U,„  and  the  internal  field  Ua-  The  field  Uin  is  the  field 
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due  to  the  impressed  sources  at  Sg  (see  Figure  1)  in  the  presence  of  the  wedge  with  the  material 
removed,  Ua  is  the  total  field  inside  the  material  body  Sa  and  Ub  is  the  total  outside  Sa- 

Certain  terms  will  be  introduced  in  order  to  simplify  the  notation.  The  term  u  is  defined  as 

sini/i,2  =  for  TM^  :  sinz/1,2  =  2'i_2/7?;  for  TE^  (1) 


where  77  is  the  intrinsic  impedance  of  free  space.  The  two-dimensional  Green’s  function  for  the 
impedance  wedge  is  denoted  by  p/),  which  represents  the  field  at  the  observation  point  p 

due  to  a  line  source  at  p/.  The  terms  r  and  a  represent  the  distribution  of  the  material  parameters 
within  the  body. 


r{p)  = 


ea(^/eo;  TM;,, 


7(^  = 


eo/ea(^;  TE2 


(2) 


TE^, 

The  resulting  integral  equation  for  the  unknown  field  Ua  within  the  material  region  is 

La 

2  , 

-f  Y^ikoSlnu^  /  Gu,(p,p/)t/a(p0(^(^0  “ 

4 

+  I  Ua{pf)  •  VtrGa^iP,  pf)  +  klG^{p.  pf){r{pf)  -  (T{pl))]  dsf 

Ub{^’,  p  e  Sb, 

0-i^Ua{^]  pe  Sa  01  P  e  (3) 

lUaim+crm  P^La, 

where  the  above  integrals  are  principle  value  integrals  when  p  6  5a  or  Ld^  2  Vt^  =  “ 

Once  Ua  is  found,  Ub  can  also  be  computed  as  indicated  above. 

An  MM  technique  is  used  to  obtain  a  solution  to  the  integral  equation  given  in  (3).  To 
implement  this  technique,  the  material  region  Sa  is  subdivided  into  trapezoidal  cells  and  the 
bordering  contour  (ia,  ij/, ,  and  Ld^)  is  subdivided  into  linear  segments.  Pulse  basis  functions  are 
used  over  each  of  the  cells  and  point  matching  is  applied  at  the  centroid  of  the  cell,  and  pulse 
basis  functions  are  also  used  for  the  Hne  segments  and  point  matching  is  applied  at  the  center  of 
each  segment. 


3.  Numerical  results 


Far  field  scattering  patterns  showing  the  interaction  of  a  material  body  and  an  impedance  wedge 
have  been  generated.  To  verify  these  results,  the  limiting  case  when  the  impedance  wedge  becomes 
a  perfectly  conducting  semi-infinite  half-plane  was  used  so  that  a  comparison  could  be  made  to 
the  results  obtained  using  a  code  that  was  developed  by  Newman  which  uses  a  MM/Green’s 
function  technique  to  solve  for  the  scattering  from  a  complex  material  body  in  the  presence  of  a 
PEC  half- plane  [6].  The  agreement  with  the  reference  solution  was  found  to  be  excellent  which 
provides  a  strong  validation  for  the  formulation  of  the  integral  equation  since  it  is  a  completely 
different  one  from  that  which  is  used  in  the  reference  solution,  and  also  verifies  the  numerical 
implementation  of  the  MM  solution. 

A  material  body  will  now  be  attached  to  the  edge  of  a  wedge  for  the  purpose  of  reducing 
its  scattering  characteristics.  A  wedge  was  chosen  with  an  interior  wedge  angle  of  72°  and  has 
impedance  faces  with  the  surface  impedance  values  Z\  =  Z2  =  300  -|-  i30fi/sq.  The  configuration 
for  this  material  body  is  shown  in  Figure  2.  The  general  shape  of  this  body  was  chosen  for  what 


Figure  2;  Configuration  for  an  inhomogeneous,  lossy  materied  body  attached  to  a  wedge  for  the 
purpose  of  reducing  its  scattering  characteristics. 

would  appear  to  be  a  good  shape  for  reducing  scattering  and  one  that  could  easily  conform  to  the 
edge  of  the  wedge.  A  lossy  dielectric  was  used  where  the  material  parameters  were  carefully  chosen 
so  that  they  would  be  consistent  with  what  is  physically  realizable  with  carbon  loaded  microwave 
absorber  materials.  The  relative  permeability  is  fir  =  I  and  two  different  distributions  were  used 
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for  the  relative  permittivity;  one  which  is  homogeneous  and  the  other  which  has  an  exponential 
distribution  in  p.  For  the  homogeneous  case  the  relative  permittivity  is  given  by  Cp  =  2  -|-  and 
the  exponential  distribution  is  given  by  Cr  =  1  +  (1  +  za)e”*’^,  where  p  is  in  wavelengths  and  e'',  a, 
and  6,  are  real,  positive  parameters. 

A  procedure  will  now  be  introduced  which  will  use  numerical  optimization  to  provide  a  means 
to  design  the  distribution  for  the  material  parameters.  For  this  procedure,  the  coefficients  e", 
a,  and  h  are  the  unknown  quantities  to  be  determined.  A  cost  function  is  defined  which  in¬ 
volves  determining  the  scattered  field  over  a  specified  spatial  region,  and  an  iterative  numerical 
optimization  routine  is  then  used  to  minimize  the  cost  function.  The  cost  function  is  defined  as 

<h 

F(S)  =  j (4) 

<^1 

where,  in  this  paper,  is  referred  to  as  the  “scattered”  field  (which  is  defined  as  the  total  field 
minus  the  incident  field  and  the  reflected  fields  from  the  wedge  faces),  <}>  is  the  observation  angle,  (j>' 
is  the  incidence  angle,  is  a  vector  consisting  of  the  unknown  coefficients  for  the  material  profile, 
and  the  angular  sector  for  which  the  cost  function  is  minimized  over  is  given  by  <  ^  <  <f)2.  An 
iterative  numerical  technique  is  then  employed  to  minimize  this  cost  function.  For  this  application, 
it  was  decided  to  use  a  conjugate  gradient  algorithm.  An  initial  guess  for  the  coefficients  is 
entered  and  the  routine  then  iterates;  each  time  incrementing  the  coefficients  in  such  a  way  so 
as  to  minimize  the  gradient  of  the  cost  function.  When  it  is  felt  that  the  routine  has  sufficiently 
converged  to  a  minimum  value  for  the  cost  function,  the  process  is  then  terminated;  yielding  the 
optimum  values  for  the  coefficients  of  the  material  distribution. 

For  the  following  examples,  the  bistatic  case  was  considered  where  the  incidence  angle  is 
<^'  =  180°  and  the  specified  region  to  minimize  the  scattered  field  is  135°  <  4>  <  225°.  A  lossy 
dielectric  material  body,  where  the  relative  dielectric  constant  could  be  either  homogeneous  or  have 
an  exponential  variation  in  p,  was  studied.  The  results  for  the  TM^  case  are  shown  in  Figure  3(a). 
Optimization  was  used  to  obtain  the  material  parameters  er=2-i-i0.676  for  the  homogeneous  case, 
and  a=0.669  and  6=4.262  for  the  exponential  variation.  The  exponential  distribution  provides  a 
respectable  reduction  of  about  25dB  around  endfire  for  the  impedance  wedge,  as  opposed  to  only 
about  6dB  for  the  homogeneous  material  body.  Results  for  the  TE^  case  are  shown  in  Figure  3(b) 
where  optimization  was  used  to  obtain  the  material  parameters  6^=2-1-10.53  for  the  homogeneous 
case,  and  a=7.04  and  6=114.6  for  the  exponential  variation.  The  homogeneous  material  body 
actually  increases  the  scattering  from  the  wedge,  which  would  indicate  that  it  somehow  interferes 
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Figure  3:  Bistatic  echo  width  plots  for  the  TM^  case  (a)  and  the  TE^  case  (b)  with  <f>'  =  180°  for 
the  configuration  of  a  lossy  material  body  attached  to  an  impedance  wedge  shown  in  Figure  2. 
For  the  TM^  case  material  body  1  is  homogeneous  with  =  2  +  tO.676,  and  material  body  2 
has  an  exponential  distribution  in  p  with  er{p  =  0)  =  2  +  iO.669  (at  the  edge  of  the  wedge)  and 
er{p  =  .25)  =  1.34  +  t0.23  (at  the  tip  of  the  material  body);  and  for  the  TE^  case  material  body  1 
is  homogeneous  with  er  =  2  +  iO.53,  and  material  body  2  has  an  exponential  distribution  in  p  with 
er{p  =  0)  =  2  + 17.04  (at  the  edge  of  the  wedge)  and  er{p  =  .25)  =  1  (at  the  tip  of  the  material 
body). 
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with  the  natural  scattering  reduction  properties  of  the  impedance  surface.  As  for  the  exponential 
distribution,  the  exceedingly  large  value  for  b  causes  the  relative  dielectric  constant  to  decay  very 
rapidly  to  1  which  would  essentially  concentrate  the  lossy  material  to  a  very  small  region  about 
the  edge  and  reducing  the  effectiveness  of  the  rest  of  the  body.  This  distribution  only  provides  a 
slight  reduction  in  scattering. 

4,  Conclusions 

A  method  for  analyzing  the  EM  scattering  from  a  two-dimensional  complex  material  cylinder  in 
the  presence  of  an  impedance  wedge  has  been  presented  in  this  paper.  A  lossy  dielectric  material 
body  was  chosen  and  different  profiles  for  the  material  inhomogeneities  were  considered.  Numer¬ 
ical  optimization  was  used  to  design  the  distributions  for  the  material  parameters.  The  results 
showed  reasonable  levels  of  scattering  reduction  are  achievable  with  this  type  of  configuration. 
The  dielectric  material  parameters  that  were  considered  in  this  study  were  kept  consistent  with 
those  that  are  physically  realizable  by  typical  carbon  loaded  microwave  absorber  materials.  The 
application  of  a  lossy  material  body  to  the  edge  of  a  wedge  is  an  effective  technique  for  reducing 
the  electromagnetic  diffraction  from  the  edge. 
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I  Introduction 

Closed  form,  asymptotic  high  frequency  expressions  have  been  obtained  for  describing  the  reflec¬ 
tion  and  diffraction  of  a  general  electromagnetic  (EM)  astigmatic  Gaussian  beam  (GB)  which  is 
incident  on  a  curved,  perfectly  conducting  screen  or  reflector  surface  containing  an  edge.  The 
incident  GB  can  be  obliquely  incident  on  the  curved  edge  of  the  reflector.  In  the  present  develop¬ 
ment,  the  edge  of  the  reflector  is  required  to  lie  in  a  plane  which  is  oriented  in  a  specific  manner 
as  explained  below.  Furthermore,  it  is  assumed  that  at  each  point  of  reflection  where  the  axis 
of  the  incident  GB  intersects  the  actual  reflecting  surface  (or  its  extension  if  the  incident  GB 
axis  misses  to  intersect  the  reflector),  the  actual  surface  can  be  approximated  loc^y  about  that 
point  by  a  second  degree  surface  (or  a  paraboloid)  with  the  same  principal  radii  of  curvature 
as  the  original  surface  at  that  point.  This  assumption  is  consistent  with  the  usual  asymptotic 
high  frequency  approximations  based  on  the  local  properties  of  high  frequency  fields.  Also,  the 
surface  radii  of  curvature  are  assumed  to  be  sufficiently  large  and  slowly  varying  for  the  high 
frequency  assumptions  to  be  valid.  Specifically,  the  plane  containing  the  edge  of  the  reflecting 
surface  is  chosen  to  be  parallel  to  the  surface  normal  at  the  apex  of  the  local  paraboloidal  surface. 
Recently,  this  condition  has  been  relaxed  so  that  the  plane  containing  the  edge  c^  be  oriented 
relatively  arbitrarily  with  respect  to  the  reflecting  surface;  this  more  recent  work  wiU  be  reported 
separately  in  the  near  future  with  applications  of  GB’s  to  the  analysis  and  even  synthesis  of  large, 
offset  fed  reflector  antennas.  The  present  analysis  is  based  on  the  use  of  the  physical  optics  (PO) 
formulation  which  is  valid  for  electrically  large  reflecting  surfaces.  The  PO  integral  for  the  field 
scattered  by  the  reflector,  when  it  is  illuminated  by  an  incident,  general  astigmatic  GB,  is  eval¬ 
uated  in  closed  form  after  introducing  the  Fresnel  approximation  within  the  integrand.  For  a 
reflecting  surface  with  no  edges,  only  a  reflected  field  is  predicted  within  this  formulation;  the 
leading  term  in  this  case  can  be  shown  within  its  paraxial  region  to  behave  as  a  reflected  general 
astigmatic  GB,  whereas  the  additional  terms  include  the  effects  of  beam  distortion  not  contained 
in  the  leading  term.  In  practical  applications,  such  as  in  the  analysis /synthesis  of  reflector  an¬ 
tennas,  the  reflecting  surface  contains  an  edge;  in  such  cases,  the  incident  GB  can  also  undergo 
diffraction  by  the  edge.  However,  since  the  incident  GB  decays  exponentially  transverse  to  the 
beam  axis,  the  effects  of  edge  diffraction  become  negligible  if  the  axis  of  the  incident  GB  is  not 

"This  work  reported  here  was  supported  in  part  by  the  Joint  Services  Electronics  Program,  under  contract 
N00014-89-J-1007. 
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very  close  to  the  edge  of  the  reflector.  The  reflection  of  a  general,  EM  astigmatic  GB  incident 
on  a  smooth  curved  perfectly  conducting  surface  is  discussed  first;  that  discussion  is  then  sub¬ 
sequently  followed  by  one  on  composite  reflection  and  diffraction  effects  which  axise  when  the 
incident  beam  axis  is  very  close  to  an  edge  in  the  reflecting  boundary.  When  the  incident  beam 
axis  strikes  the  reflector  such  that  it  is  not  very  close  to  the  edge,  then  only  surface  reflection 
effects  dominate  and  the  effects  of  edge  diffraction  can  be  ignored. 


II  GB  Reflection 

Let  a  general,  EM  astigmatic  GB  be  incident  on  a  smooth  perfectly-conducting  surface  as  shown 
in  Figure  1.  Let  the  incident  GB  axis  (along  the  i*  direction)  intersect  the  reflecting  surface  at 
0  which  now  constitutes  the  origin  of  the  surface  coordinate  system  {x,y,z)  in  which  z  is  the 
unit  outward  normal  to  the  surface  at  O.  The  surface  at  and  near  0  can  be  approximated  by  a 
local  paraboloid  as: 


1 


Ri  R2 


(1) 


Where  Ri  and  R2  are  the  local  principal  surface  radii  of  curvature  of  the  original  surface  at  0 
along  the  x  and  y  directions,  respectively.  The  magnetic  field  associated  with  this  incident 
GB  is  given  as: 


with 


^ Vi,  Zi)  =  [xiHi  +  yiH2]  B{xi,  yi,  Zi) 

B{xi,y„Zi)  = 

+  p\  ^  2*  +  p\ 


(2) 


in  which  p\  and  p^  given  by: 


P\  =  Pii-^jb{ 

P2  =  pu+j^i 


and  also 


-1  “  9(^11  "b  ^22)  i  +  ^22)^  ”  '^(Qii^22  —  ^12^21)* 


P\a 


(3) 

(4) 

(5) 

(6) 


The  and  p‘2  represent  the  principal  radii  of  curvatures  of  the  incident  GB  wavefront  at  O, 
and  the  b\  as  well  as  b),  are  related  to  the  incident  spot  sizes  at  the  waists: 


b{  = 


ku}\ 


,  =  ^ 


(7) 

(8) 
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Finally,  the  Cn,  C12,  C21  and  C22  relate  the  coordinates  {x%y\z^)  fixed  in  the  incident  GB  to  the 
surface  coordinates  {x,y,z)  at  0.  For  convenience,  the  origin  of  (a:‘,y*,z*)  is  also  chosen  to  be 
at  0  with  being  the  direction  of  axial  propagation  of  the  incident  GB  and  (x*,y‘)  being  X  i'; 
thus, 


(9) 


Under  the  PO  approximation,  the  reflected  magnetic  field,  can  be  shown  to  be  expressed  in 
terms  of  the  PO  integral  by: 


■  Xi  ■ 

Cll  C12  Ci3 

X 

Vi 

= 

C21  C22  C23 

y 

.  . 

.  C31  C32  C33 

z 

-jkR 


2t 


(10) 


in  which  {x',y',z')  are  the  coordinates  at  any  point  on  the  reflecting  surface  (with  z'  = 
{x,y,z)  are  the  coordinates  of  the  observation  point  off  the  surface,  and 
R  is  given  as  usual  by: 


R  =  ^{x-x’f  +  {y-y'f  +  (z-z'y. 

Also,  W  in  (10)  is  given  approximately  in  Fresnel  region  by 

W(x',y')  =  ^my-v')-Vy(z-z')]  +  ^l%{z-z')-V,{x-x')] 


(11) 


~  "  I'")! 


(12) 


with 


V(x',y'}  =  x 


-Wz  + 

K2 


+  y 


-  — PV3 

iti 


+  z 


and 


=  xVy;  +  yVy  +  zVz 

(13) 

Wt  = 

-fflCll  +  -ff2C21 

(14) 

W2  = 

H1C12  +  H2C22 

(15) 

W3  - 

•fflClS  +  S2C2Z 

(16) 

After  introducing  the  proper  Fresnel  approximation  in  the  phase  terms  within  the  integrand  of 
(10)  and  replacing  J?  by  r  in  the  amplitude  terms,  one  obtains  [1] 


H''{x,y,z) 


jk 

27r  r 


[H"oo/oo  +  W'io/io  +  IVoi/oi’ 


(IT) 
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Ill  GB  Diffraction 


When  the  reflecting  surface  contains  an  edge,  the  scattered  magnetic  field  contains  composite 
surface  reflection  and  edge  diffraction  effects.  Consider  the  special  edge  configuration  in  Figure  2 
in  which  case  the  PO  integral  for  the  scattered  field  is  given  by  (22)  but  with  the  limits  of 
integration  on  x'  being  from  — oo  to  Xg  instead  of  — oo  to  +oo.  Thus,  instead  of  (22)  one  now 
obtains 

H’{x,y,z)  =  +  +  (32) 

with 

li,  =  /ooi(l  +  erf(2:,)|  (33) 

in  which  the  argument  zj  of  the  error  function  is 


and 
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IV  Numerical  Results  and  Discussion 

In  Figs.  3-5,  the  incident  beam  axis  is  normal  to  the  surface  at  the  origin  and  the  principal 
beam  axes  line  up  with  the  principal  surface  directions.  The  edge  extends  past  the  reflection 
point  in  Figure  3  and  the  scattering  mechanism  is  predominantly  reflection.  In  Figure  4  the 
edge  of  the  surface  coincides  exactly  with  the  beam  reflection  point  and  the  scattering  is  almost 
equally  due  to  reflection  and  diffraction.  The  incident  beam  axis  actually  misses  the  surface  in 
Figure  5;  in  this  case  the  scattering  is  mostly  due  to  the  diffraction  of  the  incident  beam  tail. 
Figure  6  illustrates  a  case  where  the  principal  beam  axes  are  rotated  45  degrees  with  respect  to 
the  principal  surface  directions  and  in  Figure  7  the  incident  beam  direction  makes  an  angle  of 
20  degrees  with  the  z  axis  in  the  x-z  plane.  In  all  cases  the  agreement  with  the  numerical  PO 
integration  is  excellent.  Figure  8  illustrates  the  presence  of  the  Keller  cone  phenomena  for  beam 
diffraction. 

This  new  GB  technique  can  be  employed  together  with  an  iterative  algorithm  to  efficiently 
optimize  the  synthesis  of  reflector  antennas  for  generating  spot,  multiple  and  contour  beams  in 
applications  as  satellite  communications  antennas.  Such  an  approach  would  provide  a  useful  and 
efficient  analysis/ design  tool  for  reflector  antennas  [2,  3].  This  GB  technique  is  expected  to  be 
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very  fast  in  comparison  to  conventional  numerical  PO  based  techniques  which  may  take  many 
hours  of  computer  time  when  doing  a  reflector  synthesis  problem.  In  the  proposed  procedure,  the 
feed  array  distribution  for  illuminating  a  parabolic  reflector  may  be  synthesized  for  generating 
multiple  or  contoured  beams  by  expanding  the  feed  radiation  into  a  set  of  GB’s  with  unknown 
amplitudes  which  then  can  be  found  by  matching  the  resulting  reflected  and  diffracted  fields, 
produced  by  the  reflector  illuminated  by  the  GB’s,  to  the  desired  reflector  radiation  pattern. 
The  iterative  optimization  algorithm  may  be  used  to  find  the  optimal  beam  amplitudes,  which 
also  directly  provide  the  corresponding  fields  at  their  launching  points  at  the  feed;  i.e.,  they 
provide  directly  the  optimal  feed  pattern  and/or  feed  aperture  distribution.  In  the  case  of 
shaped  reflector  synthesis  for  producing  a  contoured  beam  with  a  given  feed  antenna,  the  given 
feed  radiation  is  expanded  into  a  set  of  GB’s  with  known  amplitudes  because  the  feed  pattern 
is  known  in  this  case.  The  shape  of  the  reflector  can  be  determined  by  repeatedly  but  very 
efficiently  bouncing  (reflecting)  and  diffracting  the  GB’s  incident  from  the  feed  at  each  iteration 
in  the  optimization  algorithm  where  the  reflector  shape  is  changed  appropriately  in  iterative 
steps  until  the  reflected/diffracted  GB’s  produce  the  desired  contoured  beam. 
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Figure  1:  3-D  Gaussian  beam  reflection  from  a  Figure  2:  3-D  Gaussian  beam  scattering  from  a 
curved  surface.  curved  edge. 
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Figure  4;  3-D  Gaussian  beam  scattering  from 
a  curved  edge.  Beam  reflection  point  is  on  the 
edge.  Incident  beam  and  surface  parameters: 

=  50A,ii:2  =  60A,a:e  =  =  15A,6‘  = 

20X,uj{  =  2.19A,£.;‘  =  2.52X,pi^  =  30A,p*2  = 
25A,0i  =  0. 


Scon  Angle  6  (deg) 


Figure  3:  3-D  Gaussian  beam  scattering  from 
a  curved  edge.  Beam  reflection  point  is  on  the 
surface.  Incident  beam  and  surface  parameters: 

=  50X,R2  =  60A,a:e  =  4:X,h\  =  15A,6i  = 
20A,a;j  =  2.19A,^  =  2.52A,/o*i  =  30A,p*2  = 
25A,  6i  =  0. 


Scon  Angle  8  (deg) 


Figure  5;  3-D  Gaussian  beam  scattering  from 
a  curved  edge.  Beam  reflection  point  is  off  the 
surface.  Incident  beam  and  surface  parameters: 
Rx  =  m,R2  =  eOX,x^  =  -a,h\  =  15A,6‘  = 
20A,  ujJ  =  2.19A,a;2  =  2.52X,p^^i  =  30A,p*2  = 
25A,6ii  =  0. 


123 


27 


- Reference 

- General 

Analytic  Resul' 


-90  -60  -30  0  30  60  90 

Scon  Angle  6  (deg) 

Figure  6:  3-D  Gaussian  beam  scattering  from  a 
curved  edge.  Incident  beam  and  surface  param¬ 
eters:  Ri  —  — SOAji^a  =  — 40A,a:e  =  — 

20X,bi  =  40A,a;j  =  2.52A,a;‘  =  3.57A,p‘i  = 
25A,  p‘2  =  30A,  e,  =  0. 


Scon  Angle  8  (deg) 


Figure  7:  3-D  Gaussian  beam  scattering  from  a 
curved  edge.  Incident  beam  and  surface  param¬ 
eters:  Ri  =  ~50X,R2  =  — 60A,Xe  =  = 

lOA,^  =  40A,a>‘  =  1.78A,^  =  3.57A,p‘i  = 
25A,p^2  =  20  A,  0,  =  20°. 


Figure  8:  3-D  Gaussian  beam  scattering  from  a 
curved  edge.  Plot  of  total  field  illustrating  the 
Keller  cone.  Scanning  is  in  a  plajie  containing 
the  edge.  Incident  beam  and  surface  param¬ 
eters:  Ri  —  —60X,R2  =  — 50A,Xe  =  0,&i  = 

lOA,^  =  30A,a;j  =  1.78A,^  =  3.09A,p’i  = 
25A,p‘2  =2OA,0i  =  2O°. 
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POLARIZED  SCATTERED  LIGHT  BY  A  SEMICYLINDRICAL  BOSS  ON  A 
CONDUCTING  FLAT  PLANE 


Hashim  A.  Yousif 
Natural  Sciences  Division 
University  of  Pittsburgh  at  Bradford 
Bradford,  Pa  16701 


ABSTRACT 

A  brief  description  of  the  analytical  solution  for  the  scattering  of  electromagnetic  plane  waves 
by  a  semicylindrical  boss  of  finite  conductivity  on  a  perfectly  conducting  flat  surface  is  introduced.  Nu¬ 
merical  calculations  for  the  angular  distribution  of  the  scattered  intensity  and  the  Mueller  scattering  ma¬ 
trix  elements  (S..)  in  the  far  field  scattering  region  are  presented.  The  TM  and  TE  polarizations  of  the 
incident  wave  are  considered.  These  calculations  are  compared  with  those  for  a  lone  cylinder.  No  simi¬ 
larities  are  noted  between  the  scattering  pattern  of  a  boss  and  that  of  a  lone  cylinder  with  the  same  Com¬ 
position  and  radius. 

INTRODUCTION 

The  analytical  solution  of  the  scattering  of  light  by  a  perfectly  conducting  boss  on  a  perfectly  conducting 
surface  was  investigated  by  Lord  Rayleigh'  at  the  turn  of  this  century.  Since  then  many  have  extended  or 
generalized  the  solution  to  include  the  general  case  of  a  boss  with  finite  conductivity.^  Numerical  calculations 
for  the  perfectly  conducting  boss  are  limited  and  nonexistent  for  the  general  case.  In  addition,  the  analytical 
solution  has  not  been  formulated  in  terms  of  the  Stokes  vectors  and  the  Mueller  scattering  matrix  elements 
to  the  best  knowledge  of  this  author.  These  elements  are  of  great  theoretical  and  experimental  importance 
because  they  measure  the  influence  of  the  scatterer  on  the  state  of  polarization  of  the  incident  light.  There 
are  only  8  non-zero  elements  for  the  boss  problem. 

This  problem  is  important  because  it  serves  as  a  simplified  model  to  study  the  scattering  of  light 
from  a  rough  or  imperfect  surface.  Clearly  if  the  radius  of  the  boss  is  zero,  there  will  be  no  scattering  because 
perfectly  flat  surfaces  do  not  scatter;  they  only  reflect.  However,  if  the  radius  of  the  boss  is  not  zero,  the 
scattering  of  light  from  surface  may  be  used  as  a  tool  to  monitor  and  detect  imperfections,  roughness,  or 
defects  in  that  surface.  This  model  helps  achieving  the  previous  goal  by  showing  that  light  scattering  signals 
are  sensitive  to  the  change  in  the  radius  and  the  material  of  the  boss. 

THE  ANALYTICAL  SOLUTION  DESCRIPTION 

A  semicylindical  boss  of  radius  a  on  a  flat  perfectly  conducting  surface  is  shown  in  figure  1.  The 
axis  of  the  boss  is  parallel  to  the  z  axis  of  the  cylindrical  coordinate  system  (  P  The  permittivity, 

conductivity,  and  permeability  of  the  boss  are  ,  and  jj^,  respectively.  The  medium  outside  the  boss 

is  non-absorbing  (  Oq=  0)  having  permittivity  €q  and  permeability  .  The  incident  ray  makes  an  angle 
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({)Qwith  the  x-axis. 

The  analytical  solution 
of  this  problem  is  similar  to 
that  of  a  lone  cylinder,  with 
the  exception  that  the  incident 
wave  is  no  longer  a  simple 
plane  wave  but  a  standing 
wave. 

A.  The  incident  TM  wave. 

For  this  case,  the  electric  field 
vector  of  the  incident  plane 
wave  is  parallel  to  the  axis  of 
the  boss.  The  analytical  solu¬ 
tion  can  be  obtained  by  em¬ 
ploying  the  image  method. 

According  to  this  method,  the 
boss  can  be  replaced  by  a  lone 
cylinder  and  the  effective  incident  field  is  composed  of  the  actual  incident  field  on  the  boss  and  the  field  reflected 
by  the  flat  plane.  The  effective  incident  field  for  the  exp(/cof)time  dependence  is: 

=£'o[exp07:Qp(4)-4)o)+/?exp(t7:Qpcos(4)+4)Q)],  (1) 

where  is  the  amplitude  of  the  incident  wave,  R  is  the  reflection  coefficient,  whose  value  is  - 1  for  this 
polarization,  is  the  wave  number  of  the  medium  outside  the  boss. 

The  z  component  of  the  scattered  electric  field  satisfies  the  scalar  wave  equation  and  can  be  written  as  a 
series  solution  of  the  Hankel  functions  and  the  scattering  angle.  Similarly  the  z  component  of  the  electric 
field  inside  the  boss  satisfies  the  wave  equation  and  can  be  written  as  a  series  solution  too.  By  applying  the 
boundary  conditions  {E^  ,  are  continuous  at  the  surface  of  the  boss),  expressions  for  the  scattering 
coefficients  can  be  obtained.  These  coefficients  are  functions  of  the  Bessel  and  Hankel  functions  and  their 
derivatives  and  the  electric  properties  of  the  boss  and  the  outside  medium.  The  wave  number  of  the  boss 
can  be  written: 


From  which  the  complex  index  of  refraction  of  the  boss  can  be  obtained 

K 

^h=  — 

^  k 

B.  The  TE  Incident  wave.  For  this  case  the  effective  incident  magnetic  field  vector  is  parallel  to  the  axis 
of  the  boss.  The  expression  for  the  incident  magnetic  field  is  the  same  as  the  field  given  by  equation  (1) 
with  £'Qis  replaced  by  the  amplitude  of  the  incident  magnetic  field.  In  addition,  the  value  of  the  reflection 
coefficient  R  is  one  for  this  case. 

The  scattering  coefficient  for  the  TE  case  can  be  found  from  the  TM  coefficients  by  the  use  of  the  symmetry. 
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THE  MUELLER  SCATTERING  MATRIX  ELEMENTS 


The  expressions  for  the  scattering  amplitudes  at  the  far  field  region  can  be  obtained  by  replacing 
the  Hankel  functions  with  their  asymptotic  expressions.  The  Mueller  scattering  matrix  elements  are  found 
from  the  scattering  amplitudes.  There  are  only  8  non- vanishing  elements  of  the  Mueller  scattering  matrix 
for  the  boss  problem.  In  the  limit  [  |  ,  these  elements  relate  the  Stokes  parameters  of  the  scattered 

field  (/  •^,(2  ■*,  to  the  parameters  of  the  incident  field  by  ^ 


In  writing  the  above  equation,  we  have  used  the  fact  that  not  all  the  Mueller  scattering  matrix  elements  are 
independent.  The  Mueller  scattering  matrix  elements  describe  how  the  scatterer  treats  the  various  states 
of  polarization  of  the  incident  radiation.  In  other  words,  these  elements  can  be  used  to  predict  the  state  of 
polarization  of  the  scattered  light  if  the  state  of  the  incident  radiation  is  given.  These  elements  are  measurable 
quantities.  They  can  be  described  briefly  as: 

1)  5jj(4>)  represents  the  intensity  of  the  scattered  light  if  the  incident  intensity  is  unpolarized.  In 
general,  if  the  incident  field  is  unpolarized,  the  scattered  light  is  partially  unpolarized. 

2)  ■S’j2((|>)  is  the  difference  of  the  scattered  intensities  for  the  incident  electric  vector  parallel  and 
perpendicular  to  the  axis  of  the  boss.  If  the  boss  treats  these  two  polarizations  equally,  this  element  would 
be  zero.  However,  for  our  case,  if  the  electric  vector  of  the  incident  wave  is  parallel  to  the  axis  of  the  cylinder, 
the  electric  vector  of  the  scattered  wave  is  also  parallel  to  the  cylinder  axis.  This  holds  true  for  the  other 
polarization.  In  short,  the  scattered  and  the  incident  beams  have  the  same  polarizations. 

3)  *S'33((j))  is  determined  by  the  four  combinations  of  illuminating  the  scatterer  with  linearly  polarized 
beam  makes  an  angle  of  ±45°  with  the  axis  of  the  boss  and  calculating  the  ±45  linearly  polarized  scattered 
radiation.  It  is  important  to  note  that  if  the  incident  light  is  linearly  polarized  at  T 45°  with  the  axis  of  the 
boss,  thescattered  radiation  is  elliptically  polarized. 

4)  534((j))is  similar  to  5'33((|))  except  the  incident  and  the  scattered  beams  are  left  or  right  hand 
circularly  polarized.  Also,  four  combinations  of  right  and  left  handed  circular  polarizations  are  needed  in 
this  kind  of  calculation.  In  general,  if  the  incident  light  is  circularly  polarized,  the  scattered  beam  is  elliptically 
polarized. 

It  is  important  to  note  that  the  four  elements  are  not  independent.  In  fact  they  are  related  to  each 
other  by  the  following  relation 


(S„((l»/5„((j)))2HS33(<t>)/S„((t>))'+53,(<t))/5„(4)))2=l. 


Therefore,  there  are  only  three  independent  elements  for  the  boss  problem.  This  is  the  same  number  as  for 
a  lone  cylinder.  The  interested  reader  should  consult  the  paper  by  Bickel.'^  That  paper  contains  information 
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about  determining  all  the  sixteen  elements  of  the  Mueller  scattering  matrix  experimentally. 

COMPUTER  PROGRAM 


A  Fortran  computer  program  was  developed  to  generate  numerical  calculations  for  the  far  field  scattered 
intensities  and  the  Mueller  scattering  matrix  elements.  The  program  first  calculates  Bessel  functions  and 
their  various  derivatives.  These  functions  are  used  to  compute  the  scattering  coefficients.  Once  these  coefficients 
are  calculated,  the  Mueller  scattering  matrix  elements  and  the  scattered  intensities  are  computed  using  a 
series  solution.  The  input  parameters  of  the  program  are:  a)  the  radius  of  the  boss;  b)  the  refractive  indices 
and  the  permeabilities  of  the  boss 
and  the  outside  medium;  c)  the 
wavelength  of  the  incident  radiation, 
and  d)  the  angle  of  incidence.  The 
program  has  been  tested  for  wide 
range  of  the  input  parameters  and 
gives  accurate  results. 

NUMERICAL 
CALCULATIONS 


We  have  generated  some 
numerical  calculations  for  the 
angular  distribution  of  the  4  non¬ 
vanishing  elements  of  the  Mueller 
scattering  matrix  and  for  the  far  field 
scattered  intensities,  /j  is  the 
scattered  intensity  when  the  electric 
vector  of  the  incident  wave  is 
parallel  to  the  axis  of  the  boss  (TM) 
and  I2  is  scattered  intensity  when 
the  incident  magnetic  vector  is 
parallel  to  that  axis  (the  incident 
electric  vector  is  perpendicular  to 
the  axis  of  the  boss).  We  plotted 
the  normalized  /j  ,4,  and  S'jj  as 
functions  of  the  scattering  angle. 

We  have  normalized  the  previous 
quantities  by  dividing  each  its 
maximum  value.  Therefore,  each 
of  these  quantities  is  bounded 
between  0  and  1.  The  other 
elements  (iS'j2  ’  *^33  ’  *^34^  are  normalized  by  dividing  each  of  them  by  ‘S'j  j(4))  at  any  angle.  The  value 
of  each  of  these  elen>ents  is  always  between  -1  and  +1.  The  calculations  were  made  for  various  values  of 
the  refractive  index,  the  radius  of  the  boss,  and  the  angle  of  incidence  (  (()q).  The  medium  outside  the  boss 
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Figure.  2.  S-^  and  the  normalized  scattered  intensities  (/j  ,4)  as 
functions  of  the  scattering  angle  ((f)) .  The  refractive  index  {N^) 
and  size  parameter  of  the  boss  (solid  line)  are  same  as  the 

lone  cylinder  (dashed  line).  /:„<2=3.14,  M=1.6+  fO.Oland 
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is  assumed  to  be  free  space. 

Figure  2  shows  and  the  far  field  scattered  intensities  for  a  boss  (solid  line)  and  a  lone  cylinder 

(dashed  line)  as  functions  of  the  scattering  angle.  The  refractive  index  and  the  radius  of  the  boss  are  the 
same  as  these  of  a  lone  cylinder.  The  size  parameter  ( k^a )  and  the  refractive  index  are  3. 14  and  1 .6-iO.Ol , 
respectively,  and  <t)Q  =90®  (the  incident  ray  is  perpendicular  to  the  flat  surface).  The  scatterers  are  assumed 
to  be  non-nignetic.  As  can  be  seen  from  the  diagram,  the  scattering  patterns  of  the  boss  and  the  lone  cylinder 
are  different.  In  addition,  there  is  a  noticeable  difference  between  /j  and  I 2  of  the  boss;  however,  the  difference 
is  more  pronounced  for  the  cylinder.  The  scattering  patterns  of  the  boss  and  the  cylinder  are  symmetric 
with  respect  to  the  scattering  angle  4)=90®  as  they  should  be.  We  can  draw  the  following  important  general 
conclusions  regarding  the  Mueller  scattering  matrix  elements  and  the  scattered  intensities  for  the  boss  problem: 

a)  5,2(0)=5i2(180)=-1, 

b)  533(0)=533(i80)=0, 

c)  534(0)  =53,(180)=0, 

d)  /,(0)=/j(180)  =  0. 

These  are  not  surprising  results 
because  the  total  electric  field 
(incident+reflected  by  the  flat 
surface+scattered  by  the  boss) 
must  be  zero  at  4)  =  0® ,  180  ® . 

We  investigated  this  case  further 
by  varying  the  imaginary  part  of 
the  refractive  index  starting  with 
the  current  value  of  zero  for  this 
index.  Initially,  the  diffraction 
pattern  of  the  boss  shows  a 
noticeable  change  for  a  very  small 
change  in  this  index.  However, 
the  scattering  pattern  remains 
virtually  unchanged  if  the  index 
is  increased  beyond  20.  This  is 
again  not  surprising  since  the  boss 
becomes  totally  reflecting. 

We  have  studied  the 
influence  of  the  angle  of  incidence 
on  the  diffraction  pattern  of  a  boss 
and  a  lone  cylinder.  We  found  that 
this  parameter  has  a  great  effect 
especially  for  the  boss.  Figure 
3  shows S.J ,  /j , 4  as  functions  of 
the  scattering  angle.  The  size 
parameter  and  the  refractive  index  of  the  boss  are  1.5708  and  1.6,  respectively.  The  angle  of  incidence  is 
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Figure  3.  5^..  and  the  normalized  scattered  intensities  for  a  boss 
solid  line)  and  a  lone  cylinder  (dashed  line)  with  the  same 
:ompositions  and  radii.  The  size  parameter  and  the  refractive  index 
)f  the  boss  are  1.5708  and  1.6,  respectively.  The  angle  of  incidence 
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10°.  The  dashed  line  shows  the  calculations  for  the  lone.  It  is  importrant  to  note  that  as  the  angle  of  incidence 
is  decreased,  the  scattered  intensity  of  the  incident  TM  wave  (/j(<j)))  decreases  and  becomes  smoother. 


CONCLUSION 

We  have  generated  numerical  data  for  the  Mueller  scattering  matrix  elements  and  the  scattered  intensities 
as  functions  of  the  scattering  angle  for  various  values  of  angle  of  incidence,  refractive  index,  and  radius 
of  the  boss.  These  calculations  indicate  that  the  light  scattering  patterns  are  veiy  sensitive  to  small  changes 
in  the  radius  and  the  material  of  the  boss.  When  the  radius  of  the  boss  is  small  comparable  to  the  incident 
wavelength,  the  scattering  pattern  is  smooth.  As  the  radius  starts  to  grow,  the  pattern  begins  to  develop 
stmcture.  In  addition,  our  calculations  for  the  boss  are  compared  with  those  of  a  lone  cylinder.  No  similarities 
are  noted  between  the  scattered  signal  of  a  boss  and  that  of  a  cylinder.  This  is  unexpected  result  since  the 
analytical  solution  of  the  boss  is  very  similar  to  that  of  a  cylinder. 
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1.  Introduction. 

Atmospheric  ducts  give  an  opportunity  for  transhorizon  HF/UHF  radio  communication.  Due  to 
presence  of  atmospheric  ducts  the  distance  of  radio  communication  can  reach  2500  km.  The  facts 
of  radar  observation  at  such  distances  are  well  known.  In  many  regions  atmospheric  ducts  is 
regular  phenomena.  The  percent  incidence  of  ducting  conditions  in  many  regions  is  40%-  80%  [2], 
[3].  There  exists  a  rich  statistics  of  atmospheric  ducts  appearance  and  its  parameters.  To  be  used 
practically  in  precise  radar  or  high  quality  communication  systems  the  behavior  of  radio  wave 
under  different  conditions  in  atmospheric  ducts  must  be  thoroughly  investigated.  The  vertical 
structure  of  atmospheric  ducts  along  distance  is  not  stable.  Many  of  natural  processes  such  as 
different  types  of  acoustic  or  gravity  waves  can  periodically  modify  the  vertical  profile  of  refractive 
index  [2].  Similar  perturbations  of  atmospheric  ducts  are  very  essential  for  long  distance  radio 
wave  propagation  in  atmospheric  ducts.  Even  very  weak  perturbations  can  change  completely  the 
behavior  of  rays  in  duct.  Space  periodicity  in  refractive  index  profile  along  the  ray  path  can  cause 
fast  divergence  of  rays  in  atmospheric  ducts.  As  result  the  main  part  of  rays  in  duct  becomes 
mixed  and  the  structure  of  receiving  signal  is  distorted  significantly.  This  can  create  a  serious 
problems  for  the  work  of  radar  or  wireless  communication  systems.  From  other  side  the  weak 
perturbations  of  atmospheric  parameters  can  lead  to  creation  of  a  set  of  additional  subducts  that 
can  trap  a  part  of  rays  increasing  so  the  total  range  of  radio  communication. 

For  investigation  of  ray  propagation  in  modulated  atmospheric  ducts  a  special  Duct  Radio 
Wave  Propagation  (DRWP)  program  was  developed.  It  allows  to  receive  phase  portrait  of  ray 
propagation  in  ducts  and  determine  ray  divergence  and  phase  shift  of  radio  waves. 


2.  Ray  approximation. 

In  many  cases  the  radio  wave  propagation  in  ducts  can  be  described  using  approximation  of 
geometric  optics.  In  approach  of  geometric  optics  the  optical  length  of  path  for  valid  ray  must  be 
minimal.  This  means  that  the  variation  of  action  S  for  valid  path  must  be  equal  to  zero 


<55”  =  0 


(2.1) 
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Figure  1 .  Refractivity  profile 
of  surface  based  duct. 


Figure  3.  Refractivity  profile 
of  elevated  duct. 


Figure  2.  Refractivity  gradient 
of  surface  based  duct. 


Figure  4.  Refractivity  gradient 
of  elevated  duct. 


Action  S  can  be  expressed  in  the  form 


Rb) 


sf^- 


dt 


(2.2) 


where  t  —  ReO  -  parameter  along  the  path  {0-  angle  in  the  plane  of  great  circle),  Xi-  height  above 
the  earth,  P  and  Q  are  the  beginning  and  ending  points  of  the  ray  path,  Re  -  radius  of  the  Earth, 
n  -  radio  refractive  index,  X2  -  impulse,  X2  =  n  sin  a,  a  -  angle  of  ray  inclination  to  the  Earth. 
The  altitude  -  impulse  variables  (:ri,  X2)  satisfy  to  Hamilton  equations  and  describe  completely 
the  behavior  of  rays  in  ducts  [6].  For  ray  propagation  in  duct  with  slowly  changing  along  path 
refractive  index  profile  there  exists  the  ray  adiabatic  invariant 

I  —  j  X2dxi  =  const  (2.3) 

DRWP  program  solves  the  Hamilton  equations  that  define  the  evolution  of  variables  (xi,  X2), 
draw  ray  propagation  phase  portrait  and  calculate  phase  shift  and  space  coordinates  of  rays. 


3.  Radio  wave  propagation  in  atmospheric  ducts. 

Very  often  ray  tracing  and  predicting  programs  use  different  analytical  or  semi  empirical  models  of 
refractive  index.  As  input  these  models  can  use  several  (5-15)  points  of  refractive  index  obtained 
from  measurement.  For  short  distance  of  1-3  ray  hops  in  duct  this  gives  a  good  result  in  evaluation 
of  field  strength  and  ray  tracing.  For  larger  distances  it  is  necessary  to  use  more  detailed  structure, 
DRWP  program  allows  to  use  all  information  about  refractive  index  structure  along  the  ray  path 
without  any  restriction.  The  examples  of  two  used  for  calculation  refractive  index  profiles  are 
presented  in  Figures  1,3.  The  refractivity  profile  in  Figure  1  corresponds  to  the  surface  based 
duct  and  is  typical  for  region  in  vicinity  of  Pt.  Mugu,  CA  [4],  The  height  gradient  of  that  profile 
is  presented  in  Figure  2.  The  refractivity  profile  in  Figure  3  corresponds  to  the  elevated  duct  and  is 
typical  for  San  Diego  region  [1].  The  corresponding  height  gradient  of  profile  is  presented  in  Figure 

4.  The  numerical  value  of  height  gradient  of  refractive  index  profile  determines  the  directions  of 
ray  propagation  and  zones  where  rays  can  be  trapped  in  ducts.  If  the  vertical  gradient  of  radio 
refractive  index  is  negative  and  less  then  -157  N  units/km  the  rays  bend  downward  to  the  earth 
[5].  It  is  the  regions  where  ducting  is  possible. 

During  calculations  the  weak  periodical  perturbations  were  superimposed  on  background  re¬ 
fractive  index  profile.  The  resulting  variations  of  refractive  index  profile  and  its  gradients  are 
represented  in  Figures  1-4  as  thin  curves.  Results  of  calculations  for  different  conditions  are 
represented  as  phase  portraits  in  Figures  5-8. 

The  ray  propagation  phase  portrait  in  unperturbed  duct  for  refractivity  profile  shown  in  Figure 
3  is  represented  in  Figure  5.  The  height  gradient  of  refractivity  profile  (thick  curve  in  Figure  2) 
has  two  regions  where  the  numerical  value  of  gradient  is  less  then  -157  N  units/km.  In  accordance 
with  Figures  1,3  the  phase  portrait  in  Figure  5  has  two  regions  with  trapped  rays.  The  first  is 
surface  based  duct  in  which  the  rays  reflect  from  the  earth.  The  second  is  elevated  subduct.  The 
rays  propagate  in  it  not  touching  the  earth.  From  Figure  2  it  is  seen  that  gradient  of  background 
refractive  profile  has  two  regions  where  the  numerical  value  of  gradient  is  near  the  critical  value  of 
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Figure  5.  Phase  portrait  of  surface 
based  duct. 


Figure  6.  Subducts. 


Figure  7.  Phase  portrait  of  Figure  8,  Phase  portrait  of 

elevated  duct.  modulated  duct. 


-157  N  units/km.  In  the  absence  of  perturbation  no  duct  exists  at  these  heights  (Figure  5).  But 
during  modulation  of  refractive  index  the  gradient  can  become  slightly  less  then  -157  N  units/km 
(thin  curves  in  Figures  1,3).  This  leads  to  arising  of  additional  two  subducts  that  can  trap  rays. 
The  examples  of  rays  trapped  in  these  additional  elevated  subducts  are  shown  on  phase  portrait 
in  Figure  6. 

The  phase  portrait  of  ray  propagation  for  refractive  index  structure  shown  in  Figure  3  is 
represented  in  Figures  7,8,  Figure  7  corresponds  to  unperturbed  duct  while  Figure  8  corresponds 
to  periodically  modulated  duct.  In  the  absence  of  disturbances  the  total  phase  space  (Figure  7) 
consists  of  non  intersecting  separate  curves.  Each  curve  corresponds  to  the  separate  ray  path.  It 
is  seen  that  the  duct  has  complex  thin  structure.  One  part  of  rays  propagate  in  upper  system 
of  elevated  subducts.  Another  part  of  rays  propagate  in  elevated  duct  near  the  earth  and  can 
periodically  reflect  from  the  earth. 

Evolution  of  several  rays  in  modulated  duct  is  shown  in  Figure  8.  The  single  ray  in  modulated 
duct  can  fill  in  the  whole  area  in  the  phase  portrait.  These  are  regions  with  fast  ray  divergence. 
For  a  great  part  of  rays  it  is  possible  only  temporal  trapping  in  duct  because  of  fast  divergence. 
Comparison  of  phase  portrait  in  Figures  7,8  shows  that  the  upper  system  of  elevated  subducts  is 
destroyed  due  to  perturbation  of  refractive  index  profile. 

4.  Divergence  of  rays  in  periodically  modulated  ducts. 

The  velocity  of  ray  divergence  is  a  very  important  factor  to  be  taken  into  account  under  elaboration 
of  different  radio  communication  systems.  To  investigate  the  ray  divergence  in  ducts  the  paths  of 
several  close  rays  for  profile  shown  in  Figure  3  were  tracked.  An  example  of  ray  trace  in  periodically 
modulated  duct  is  represented  in  Figure  9.  A  part  of  the  same  ray  trace  in  unperturbed  duct  is 
shown  in  Figure  10.  In  unperturbed  duct  the  ray  performs  regular  sine  like  oscillations  (Figure 
10)  while  in  modulated  duct  the  form  of  oscillations  is  rather  complicated  (Figure  9). 

Phase  shift  of  radio  waves  in  duct  is  proportional  to  optical  length  of  their  ray  paths.  The 
absolute  values  of  refractive  index  variations  for  radio  waves  are  very  small.  For  all  rays  in  narrow 
beam  the  optical  length  of  path  is  almost  the  same.  Plots  of  optical  length  for  ray  in  narrow 
beam  as  in  modulated  and  unperturbed  ducts  lie  almost  on  the  same  straight  line  (Figure  12).  In 
unperturbed  duct  narrow  beam  remain  to  be  narrow  at  all  the  range.  In  modulated  duct  because 
of  the  rapid  divergence  of  rays  the  beam  widens  quickly  and  rays  fill  in  all  space  in  duct.  Straight 
line  in  Figure  12  illustrates  the  fact  that  the  ray  adiabatic  invariant  (2.3)  is  approximately  constant 
for  close  rays  in  duct  although  the  ray  traces  can  differ  significantly.  Radio  waves  propagating  in 
modulated  duct  have  almost  the  same  phase  shift  as  in  unperturbed  duct.  In  more  detail  scope 
there  exists  difference  in  optical  length  for  rays  in  perturbed  and  unperturbed  ducts.  In  relative 
units  the  difference  in  optical  length  for  close  rays  is  represented  in  Figure  11.  The  straight  lines 
4,5  in  Figure  11  correspond  to  two  initially  close  rays  in  unperturbed  duct.  The  difference  in 
optical  path  for  them  is  very  small.  These  rays  continue  to  be  close  during  all  the  range.  Curves 
1,2,3  in  Figure  11  correspond  to  rays  in  narrow  beam  that  propagate  in  modulated  duct.  They 
have  oscillating  character.  This  means  the  receiving  signal  will  have  additional  oscillations  too. 
The  difference  in  optical  length  for  rays  in  modulated  duct  is  far  more  than  for  rays  in  unperturbed 
duct.  In  modulated  duct  the  divergence  of  rays  is  almost  in  50  times  more  than  in  unperturbed 
duct.  At  long  distances  the  absolute  value  of  optical  length  for  rays  propagating  in  modulated 
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duct  is  less  than  in  unperturbed  duct.  It  is  connected  with  the  fact  that  in  modulated  duct  rays 
penetrate  in  the  regions  where  the  refractive  index  has  less  value  and  propagate  there  for  a  lot  of 
time.  Meantime  the  path  length  of  rays  in  modulated  duct  is  more  than  for  rays  in  unperturbed 
duct. 

In  spite  of  small  value  of  periodical  perturbations  of  atmospheric  ducts  they  can  change  sig¬ 
nificantly  the  propagation  of  radio  waves.  Ray  divergence  in  periodically  modulated  duct  can  be 
very  large.  At  distance  of  about  100  km  the  height  difference  between  two  initially  close  rays  can 
reach  full  height  thickness  of  duct.  This  makes  difficult  the  restoring  of  images  in  radar  observa¬ 
tion  systems.  Another  effect  is  connected  with  additional  phase  shift  of  radio  waves  in  modulated 
ducts.  For  close  rays  propagating  in  modulated  duct  the  phase  shift  can  reach  critical  value  at 
distance  of  20  km  for  frequency  1.4GHz.  After  this  distance  the  work  with  devices  based  on  phase 
methods  can  faced  with  problems.  In  unperturbed  ducts  the  critical  length  is  about  1000  km, 
that  is  enough  for  many  of  applications. 

5.  Summary. 

DRWP  program  was  elaborated  specially  to  investigate  radio  wave  propagation  in  atmospheric 
ducts.  The  results  of  calculations  are  represented  as  ray  propagation  phase  portraits.  The  phase 
portraits  give  explicit  picture  of  regions  were  ducting  is  possible  and  allow  to  estimate  and  predict 
radio  wave  propagation  in  atmospheric  ducts  under  different  real  natural  conditions.  DRWP  pro¬ 
gram  tracks  rays  in  modulated  duct  and  calculates  ray  divergence  and  phase  shift  of  radio  waves. 
It  allows  to  to  estimate  the  range  of  radio  communication  in  modulated  ducts  and  conditions 
under  which  the  ducting  is  possible. 
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Abstract 

An  approximate  description  of  the  plane  wave  diffraction  by  a  dielectric  wedge  with  small 
refractivity  and  absorption  is  found  using  the  parabolic  equation  method. 


Diffraction  by  dielectric  wedge  remains  to  be  one  of  the  most  intriguing  problems  of  the  linear 
wave  theory  due  to  its  deceiving  simplicity  and  numerous  technical  applications.  It  serves  as  a  key 
problem  for  any  diffractive  element  containing  grooves,  strips  or  abrupt  edges.  Novel  technologies 
(subsurface  radio  sounding, optical  microlithography.  X-ray  imaging  optics  etc.)  demand  a  very 
accurate  determination  of  the  wave  field  distribution  near  the  features  under  investigation.  It 
stimulates  the  search  of  new  analytical  and  numerical  solutions  of  this  classical  but  still  unsolved 
problem  [1]. 

Here,  we  consider  an  important  particular  case  of  a  rectangular  dielectric  wedge  illuminated  by  a 
plane  wave  impinging  its  face  normally,  with  zero  angle  of  incidence.  It  is  assumed  that  the  dielectric 
material  is  characterized  by  weak  refraction  and  absorption:  n  —  e  —  \  +  a.  -{■  ijd ,  \a,  II\  «  1  (it 
depends  on  its  structure  and  chemical  content  but  it  is  always  true  for  EM  waves  of  the  extreme  UV 
and  X-ray  spectral  band).  Under  these  assumptions,  one  can  separate  the  diffraction  phenomena 
occurring  at  both  wedge  faces;  the  incident  wave  enters  the  dielectric  medium  trough  its  front 
side  practically  unperturbed,  with  a  small  reflection  coefficient,  and  propagates  along' the  second 
interface  with  small  but  noticeable  absorption  and  with  phase  velocity  different  from  the  free-space 
light  velocity.  If  one  uses  the  geometric  optics  approximation,  a  gradually  growing  discontinuity 
arises  between  this  wave  and  its  free-space  counterpart  propagating  outside  the  dielectric  wedge. 
Diffraction  matches  these  two  waves,  and  this  effect  can  be  described  separately  from  the  wave 
phenomena  occurring  at  the  front  side.  This  simplifies  the  analytical  technique  and  allows  one  to 
obtain  an  exact  solution  of  our  simplified  problem. 

Let  the  dielectric  material  fill  the  quadrant  0  <  ^  <  1:12  oi  the  cross-section  plane  2:  = 
rcos<p,  y  =  rsimp.  Under  the  condition  \s  ~  1|  <<  1,  we  can  use  the  paraxial  Ansatz 

Ex{z,  y)  =  u[z,  y)  exp(iA;a  -  iwt)  (1) 
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for  a  wave  packet  propagating  along  z-axis  which  leads  to  a  pair  of  parabolic  equations 

2a|f  +  0  +  «:^(e-  1)«  =  0,  too 
,  J/  <  0 


$y2 

2ik^ 


_ L  £b1 

dz  ^ 


(2) 


describing  the  wave  field  on  both  sides  of  the  interface  7/  =  0,  >  0  [2].  The  reflected  wave  for 

2  <  0  is  determined  by  the  small  Fresnel  reflection  coefficient  /?  =  (!-  Ve)/[l  +  Ve)  «  1  and 
can  be  neglected.  The  transmission  coefficient  T  =  2/(1  +  Ve)  ss  1,  consequently,  to  the  first 
approximation,  the  initial  condition  can  be  written  as 


u(0,y)  =  l,  y  >  0 
^io(0,y)  =  1,  y  <  0 


(3) 


(4) 


The  matching  conditions  at  the  interface  y  =  0  (continuity  of  and  H^)  resulting  in 

u{z,  0)  =  uo{a,0) 

and  some  radiation  condition  excluding  waves  coming  from  y  =  ±00  must  be  added  to  complete 
the  boundary  value  problem  formulation  [3]. 

An  exact  solution  of  Eqs.(2)-(4)  can  be  found  in  the  form  of  Laplace  transform: 

^  c+too 

w(2,y)  =  ^  / 


(5) 


Substitution  into  (2)  yields  a  pair  of  ordinary  differential  equations 


d^u 

9j/2 


+  [2ikp  +  -  1)]U —  2iku{Q,y),  y>0 

^  +  2f^p[/o  =  2f^uo(0,y)  ,  y<0 


(6) 


Their  solutions  satisfying  the  matching  conditions  (4)  and  bounded  at  infinity  are 


f/o(P,!/)  =  f  [(?-  1)  + 


with  q  =  \j2ikp  +  P(£  ■  1),  yo  =  \/2ikp. 

Substitution  into  the  Laplace  integral  (5),  after  deformation  of  the  integration  path  and  the 
change  of  variables  p  =  I z,  yields 


oo--ic  /  ^  \ 

r  -oo-tc  ^  ^ 


(8) 


Here,  the  dimensionless  parameters  rj  =  \y\sjk/2z  and  7  =  \Jkz[£  -  \)l2i  «  ^kz{0  -  ia)  are 
introduced.  This  expression  can  be  decomposed  into  the  sum 


uo{z,y)  =  $(y)  +  ’I'(y,7) 


(9) 
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of  the  Fresnel  integral 

oo-tc  A4  0  ^ 

=  j  exp(-t2-2t;7e‘"/")  j  =  (10) 

-co-tc  0 

and  another  special  function 

—  00  — :c 

The  first  term  ^{rj)  is  generic  for  all  the  edge  diffraction  problems.  The  second  term  ^'(7?,7) 
depends  on  the  material  properties  a  and  ,5  modifying  the  simple  Fresnel  diffraction  pattern  in  the 
penumbra  region.  At  the  interface  y  =  0,  it  can  be  reduced  to  the  modified  Bessel  function; 


Quite  similarly,  in  the  upper  quadrant  a  >  0,  j/  >  0  (inside  the  dielectric  wedge),  the  solution 
(5).  (7)  can  be  expressed  in  terms  of  the  Fresnel  integral  and  the  ^-function; 

u{z,y)=^Je’’^U{p,y)dp  = 
r 

[$(??)  + ^(77,  zp)]  (13) 

In  virtue  of  (12),  formulae  (8)  and  (13)  coincide  at  the  interface  u  =  0.  Taken  together,  they  provide 
continuous  matching  of  two  plane  waves  propagating,  accordingly,  in  free  space  and  the  dielectric 
body  with  different  phase  velocities  and  absorption.  They  describe  a  joint  effect  of  transversal 
diffusion  [3],  edge  diffraction  and  surface  refraction  forming  a  rather  complicated  field  distribution 
near  the  interface  z/  =  0. 

Asymptotic  evaluation  of  the  integrals  (8)  and  (13)  yields  approximate  formulae  consistent  with 
geometrical  theory  of  diffraction  [4].  The  off-integral  terms  correspond  to  the  unperturbed  plane 
waves  exp(zfcz)  and  exp(zfcz  -  7^)  =  exp[z^:2(l  +  ct  +  z/?)]  =  exp{iknz)  inside  in  free  space  and  the 
dielectric  wedge. 


1  + 


-  1 


Generally,  the  main  contribution  into  the  integrals  (8)  and  (13)  is  given  by  the  saddle  point 
t  =  The  corresponding  exponential  exp{ir)^)  =  exp{iky'^ f2z)  after  multiplication  by  the 

rapidly  oscillating  factor  exp{ikz),  yields  the  edge  wave  with  the  approximate  eikonal 


2  +  y^l2z  K.  ■'J ^ 
z{l  +  a  -  i(3)  +  z/^/22:  sa  zzV -f.  y2_ 


y  <  0 
y  >  0. 


(14) 
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By  deformation  of  the  integration  path  to  the  steepest  descent  path  Imt  =  one  of  the 

branch  points  i  =  ±7  in  (8)  and  t  =  i'y  in  (13)  may  be  crossed.  Integration  around  the  cut  between 
the  branch  points  gives  a  contribution  corresponding  to  the  ’’head  wave”  propagating  along  the 
wedge  side  y  =  0  in  the  medium  with  maximum  phase  velocity  (vacuum  or  dielectric,  depending 
on  the  sign  of  a)  and  leaving  the  interface  at  the  angle  of  total  reflection.  The  most  pronounced 
effect  is  observed  i^he  absorption  is  small:  0  «  a.  In  this  case,  the  head  wave  exists  in  the 
sector  0  <  y  <  inside  the  dielectric  wedge  for  a  >  0  (this  is  typical  for  radio-  and  optical 

frequences)  and  in  the  free-space  sector  -zV -2a  <  y  <  0  for  a  <  0  (in  the  extreme  UV  and  X-ray 
spectral  region).  The  corresponding  eikonals  are  «  z  +  y^ 2a  and  fs;  2(1  + a)  -  y-y-2a. 
Interference  of  the  head  wave,  within  the  domain  of  its  existence,  with  the  unperturbed  plane  wave 
and  the  edge  wave  results  in  a  rather  complicated  field  distribution  with  maxima  higher  than  in 
the  case  of  Fresnel  knife-edge  diffraction.  This  effect  has  been  revealed  in  numerical  modeling  of 
thick  Fresnel  zone  plates  [5]. 

This  work  has  been  supported  in  part  by  the  Russian  Basic  Research  Foundation,  Grant  No 
95-05-14027. 
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Introduction 

Fresnel  zone  plates,  being  the  main  elements  of  X-ray  and  EUV  transmission  imaging  optics,  are 
very  fine  and  expensive  objects.  They  require  accurate  design  and  high  precision  manufacturing  to 
achieve  desirable  performance.  For  testing  their  quality  and  final  alignment  of  the  optical  system, 
special  methods  and  experimental  facilities  are  being  developed.  The  straightforward  measurement 
of  the  wave  field  near  the  focal  plane  is  difficult  due  to  microscopic  size  of  the  zone  plate,  small 
focal  length  (about  1  mm)  and  submicron  spatial  resolution.  Instead,  information  about  the  wave 
front  structure  and  aberrations  is  obtained  from  the  far  field  {z  ~  10cm)  measurements  [ij.This 
introduces  new  problems  as  the  relation  between  the  far  and  near  fields  is  not  described  by  simple 
formulae  of  geometric  optics  and  requires  accurate  diffraction  calculations. 

The  objective  of  this  paper  is  to  simulate  and  theoretically  investigate  the  far-held  intensity 
and  phase  distribution  produced  by  realistic  EUV  Fresnel  zone  plates  and  their  idealized  models. 
Such  theory  is  necessary  for  adequate  interpretation  of  the  observed  diffraction  patterns  and  helps 
one  to  distinguish  between  the  fundamental  diffraction  effects  and  the  aberrations  introduced  by 
manufacturing  and  alignment  errors. 

LNumerical  modeling  of  the  far-field  intensity  distribution 
in  EUV  interferometry  experiments 

We  used  L-3  computer  code  developed  at  Moscow  X-ray  Optics  Group  to  simulate  the  far- 
held  EUV  light  intensity  distribution  observed  in  the  point  diffraction  interferometry  experiments 
[1].  The  mathematical  basis  of  the  L-3  code  is  described  in  [2j.  Here  we  present  some  results  of 
straightforward  held  calculation  for  a  realistic  Fresnel  zone  plate  (ZP)  with  the  parameters  close 
to  [1],  for  an  idealized  thin  Fresnel  ZP  and  for  a  model  perfect  lens  of  the  same  optical  parameters. 
Our  main  goal  was  to  understand  the  nature  of  the  intensity  modulation  of  the  far-held  radiation 
patterns  produced  by  the  zone  plates  tested  in  [1].  Two  hypotheses  has  been  considered: 

1.  Two-scale  quasiperiodic  intensity  modulation  arises  in  the  near  held  of  the  ZP  due  to 
resonance  effects  inside  it  [3]  and  is  transferred  into  the  far  zone  according  to  the  laws  of  geometric 
optics; 

2.  This  intensity  modulation  is  caused  by  Fresnel  diffraction  of  the  ZP  hrst-order  spherical 
wave  by  the  order  sorting  diaphragms. 

We  also  had  in  mind  that  the  observed  effects  might  be  caused  by  random  manufacturing  errors 
of  zone  positioning. 

In  order  to  verify  these  hypotheses  we  reproduced  numerically  the  experimental  setup:  a  thick 


142 


zone  plate  of  external  radius  =  96/:im  composed  of  N  =  640  alternative  opaque  zones  and 
vacuum  spacings  is  illuminated  by  EUV  plane  wave  radiation  with  A  =  13.5nm.  The  refraction 
and  absorption  constants  of  the  material  (Ni)  are  S  =  0.138,  /?  =  0.031  at  this  wavelength.  The 
zone  plate  thickness  is  t  =  200  nm,  and  the  outer  zone  width  is  drjv  =  75  nm.  The  diaphragms  used 
are:  central  stop  (CS)  of  radius  r«  =  and  order  sorting  aperture  (OSA)  of  rosA  =  24.3  pm 

placed  at  zqsa  =  0.8  mm  from  the  ZP  (the  focal  distance  at  this  wavelength  is  /  =  1.1  mm). 

The  global  diffraction  field  distribution  has  been  calculated  via  numerical  integration  of  the 
parabolic  wave  equation  [2]  beginning  from  the  front  side  of  the  zone  plate  up  to  the  range  ^  = 
1.8  mm  (0.7  mm  behind  the  focus)  where  a  steady  far-field  radiation  pattern  is  already  formed.  In 
some  calculations,  an  additional  diaphragm  in  the  focal  plane  was  introduced  in  order  to  simulate 
the  effect  of  knife-edge  diffraction. 

Consider  the  numerical  results.  In  Fig. la,  radial  amplitude  and  phase  distribution  over  the 
back  side  of  the  zone  plate  is  plotted  (its  fragment  centered  at  the  CS  edge  is  shown  in  Fig.lb). 
After  removing  the  sharp  oscillations  due  to  edge  diffraction  at  the  interfaces  [3],  it  reproduces 
with  enough  accuracy  the  ZP  transmission  function  calculated  in  Kirz’s  approximation  [4] 


Tir)  = 


exp{-ik6  -  k^)t  for  <r  <  r2n-i, 

1  for  r2n-i  <  r  <  r2n-,  n  —  A'o, . . . ,  A 


-see  Fig.2a,b.  This  calculation  proves  that  in  our  case  there  is  no  pronounced  intensity  modulation 
at  the  ZP  output  and  the  first  hypothesis  must  be  rejected. 

The  next  pictures  (see  Fig. 3)  show  the  far-field  amplitude  distribution  across  the  illuminated 
ring  r  ~  30  -  60  pm  at  the  distance  2;  =  1.8  mm  from  the  ZP.  Here,  two  scales  of  radial  amplitude 
variation  are  clearly  seen:  a  very  fast  oscillations  looking  as  a  projection  of  the  ZP  transmission 
function  and  a  broader  modulation  resembling  classical  Fresnel  diffraction  patterns.  Their  super¬ 
position  produces  a  complex  chaotic  quasiperiodic  pattern  with  up  to  30%  amplitude  modulation 
consistent  with  the  experimental  results  [1].  It  must  be  stressed  that  no  zone  positioning  error 
had  been  introduced  into  the  computational  model.  Even  the  idealized  infinitely  thin  Fresnel  zone 
plate  displays  the  same  far-field  radiation  pattern  (with  a  smaller  peak  amplitude  due  to  different 
diffraction  efficiency)  -  see  Fig. 4. 

So  the  two-scale  far  field  intensity  modulation  observed  in  [1]  is  an  intrinsic  feature  of  the  ZP 
diffraction  field  though  the  nature  of  these  two  scales  is  apparently  different. 

In  order  to  separate  these  effects  we  performed  calculation  of  the  wave  field  produced  by  a 
model  perfect  lens  of  the  same  diameter  and  focal  length.  The  same  optical  scheme  (CS  +  OS.A.) 
has  been  modeled.  Fig. 5  shows  the  far-field  radial  distribution  over  the  illuminated  region  at  the 
range  z  =  1.8  mm. 

We  see  that  the  short-period  oscillations  have  disappeared  while  the  slower  modulation  remains 
(some  high-frequency  ’’noise”  is  present  in  the  outer  part  of  the  illuminated  ring  which  is  due 
to  knife-edge  diffraction  caused  by  a  diaphragm  put  in  the  focal  plane).  Generally,  large-scale 
features  of  the  far  field  amplitude  distribution  for  the  ZP  and  for  its  equivalent  lens  are  similar 
(number  of  peaks  about  20,  modulation  depth  about  20%  in  amplitude  or  40%  in  intensity)  The 
main  difference  is  less  regular  but  more  stochastically  uniform  distribution  of  the  peak  amplitudes 
in  the  former  case.  Nevertheless,  the  lens  model  may  be  quite  useful  for  a  rough  prediction  of  the 
ZP  far  field  structure.  Below,  in  Section  2,  we  derive  a  simple  analytical  formula  describing  this 
effect  as  a  result  of  edge  diffraction  by  the  central  stop  and  OSA. 
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2.  Diffraction  of  convergent  spherical  wave  by  annular  aperture 

Let  ({p)  ~  f  -  V P  ~  p'^  be  a  convergent  spherical  wave  front  produced  by  a  lens  of  focal  length 
/  -  see  Fig. 6.  Suppose  that  it  is  partly  blocked  by  a  diaphragm  and  a  central  stop  of  corresponding 
angular  radii  ^  —  B  and  -  A.  We  are  interested  in  spatial  distribution  of  light  intensity  in  the 
far  zone  at  ranges  z  >  f. 

Following  Debye’s  classical  analysis  -  see  [5]  we  use  the  Kirchhoff  diffraction  integral 


over  the  open  portion  of  the  initial  wave  front.  Here,  R[P^  Q)  is  the  distance  between  the  observa¬ 
tion  and  integration  points  P[z,r,<p>)  and  QiCiPi'^)^  A  =  27r/^  is  the  wavelength. 

In  polar  coordinates  centered  at  the  focal  point  z  =  f: 

r  =  hsind,  z  =  f  +  hcosO 
p  —  fsinP,  (  =  /(I  -  cosfS) 


the  optical  path  R  takes  the  form 

R  =  ^{rcosip-  pcos^l^y  -t-  {rsimp  -  psinipy  +  (^  -  C)^  == 

=  ■^h'^  +  P  +  2hf[cos0cosl3  -  sinOsindcos{ip  -  z/')] 

In  our  case,  A  ~  10  nm,  /  ~  10®  nm  A,  Unlike  [5],  we  do  not  consider  the  observation  point  P 
being  close  to  the  focus.  On  the  contrary,  we  suppose  h  ~  f  whereas  the  polar  angles  /5,  9  are  small 
due  to  limited  numerical  aperture  A,  B  <  0.1.  Neglecting  the  terms  of  the  order  of  magnitude 
0{,3^  +  yields 

+  +  +  (3) 

2{h  +  f) 

We  denote: 

^  ^  +  tt  +  7,  cos{fp  -  ■0)  =  -  cosy 

To  this  approximation, 

E{P)  =  F{h,e)  (4) 

h 

where  the  radiation  pattern 

IJ  ^  i  2  Jr  ^22 

F{h,  ^  J  ^  J  d^  (5) 

A  -TT  A 


As  p  =  kH  ~  5  X  10®,  l3,  9  ^  0.1,  the  argument.  p9l3  —  10®  is  large,  and  the  Bessel  function  can 
be  replaced  by  its  asymptotics 


Jo(0  ~ 
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We  obtain  approximately 


V2i^p6  X  L_ 

(B+0)Vf  , 

J 

(A+0)-\/  I 


= 


, _  {B-OWl 

{A-eWl 


A  +  i^/f 


Integration  by  parts  shows  that  the  first  integral  in  the  brackets  is  of  the  order  O  (^).  and  the 
second  one  can  be  rednced,  in  the  first  approximation,  to  the  ordinary  Fresnel  mtegral.  So  we  get 


F{K6)^  -- 


Conclusion 

The  verv  good  agreement  between  theoretical  and  1 f 

have  a  sdflnsistenl  analytical  and  numerical  description  of  the  diffractron-hm.ted  EUV  opt.cal 
Ltems  and  allows  us  to  make  a  reliable  statement  on  the  nature  of  the  interference  fringes 
in  the  far  field  of  the  zone  plates  tested  in  [1):  they  arise  independently  frorn  the  manufactu  g 
accuracy  as  a  result  of  the  edge  diffraction  by  the  central  stop  and  ** 

An  equivalent  perfect  lens  placed  into  this  envimnment  would  produce  ^ 

The  zone  plates  give  more  indented  intensity  distributions  due  to  projec  i 

structure  into  the  far  zone.  The  individual  peculiarities  are  probably  dependent  mostly  on  the  ed„e 
features  ot  the  zone  plate  at  test. 
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All  current  radar  target  identification  (RTID)  methods  based  on  high  range-resolution  radar  require  a  target 
library.  The  target  library  stores  the  encoded  target  specific  scattering  behavior  of  each  target  to  be  identified.  Because  of 
the  expense  and  difficulty  of  making  radar  measurements,  calculated  returns  are  almost  always  used  to  construct  target 
libraries. 

Currently,  target  libraries  are  constructed  from  antenna-independent  radar  profiles  using  calculated  returns.  The 
antenna-independent  radar  profile  of  a  target  is  the  time-domain  radio  frequency  (RIO  envelope  of  a  specified  component 
of  the  scattered  electric  field  which  results  from  a  linearly  polarized  transmitted  wave  with  a  specified  electric  field 
direction.  For  the  antenna-independent  radar  profile,  the  electric  field  components  of  both  the  transmitted  and  scattered 
electric  fields  are  specified  in  the  target’s  coordinate  ^stem.  For  example,  for  a  target  ID  system  where  the  radar  antenna 
is  linearly  polarized  with  vertical  polarization  (with  respect  to  the  antenna’s  coordinate  system),  the  antenna-independent 
radar  profile  is  calculated  from  the  vertical  (with  respect  to  the  target’s  coordinate  ^stem)  component  of  the  scattered 
electric  field  due  to  a  linearly  polarized  transmitted  wave  with  a  vertically  (with  respect  to  the  target’s  coordinate  system) 
directed  electric  field. 

Use  of  the  target’s  coordinate  system  is  convenient  because  it  avoids  adding  another  dimension  to  the  target 
libraiy  storage.  Unfortunately,  this  results  in  an  inadequate  description  of  the  target’s  scattering  characteristics  since,  in 
the  example  above,  it  assumes  that  the  vertical  direction  with  respect  to  the  antenna  coordinates  is  the  same  as  the  vertical 
direction  with  respect  to  the  target  coordinates.  Although  the  two  vertical  unit  vectors  lie  in  the  same  plane,  it  can  be 
demonstrated  that  in  general  they  are  rotated  by  a  constant  angle  which  depends  on  the  relative  orientation  of  the 
antenna’s  coordinate  system  with  respect  to  the  target’s  coordinate  system. 

In  this  presentation,  we  examine  the  impact  on  radar  target  identification  systems  of  using  antenna-independent 
radar  profiles  to  generate  the  target  library.  This  examination  is  accomplished  by  comparing  anteima-independent  radar 
profiles  with  antenna-dependent  radar  profiles.  The  antenna-dependent  radar  profile  of  a  target  is  the  time-domain  radio 
frequency  envelope  of  the  signal  traveling  out  of  the  receiver  antenna  which  results  from  a  band-pass  limited  signal 
traveling  into  the  transmitter  antenna.  Antenna-dependent  radar  profiles  were  developed  for  this  work.  The  method  used 
to  generate  the  antenna-dependent  radar  profile  precisely  accounts  for  the  frequency-dependent  polarization  of  the 
anteima,  the  vector  nature  of  the  transmitted  electromagnetic  field,  the  vector  nature  of  the  electromagnetic  field  scattered 
from  the  target,  as  well  as  the  exact  geometrical  relationship  between  the  antenna  and  the  target. 

This  presentation  begins  with  development  of  the  bistatic  radar  equation  in  a  form  which  is  explicitly  in  terms  of 
parameters  which  fix  the  geometrical  relationship  between  the  antennas  and  the  target,  and  in  terms  of  the  measured  or 
calculated  antenna  and  target  responses.  The  radar  equation  is  developed  in  both  the  time  and  frequency  domains.  The 
antenna-dependent  radar  profile  and  antenna-independent  radar  profile  are  then  defined  and  discussed.  The  presentation 
ends  with  computed  results  for  one  and  two  vector-component  wire  antennas  and  for  two  aircraft  targets.  The  conclusions 
return  to  a  discussion  of  the  target  libraiy  and  suggest  several  possible  remedies  to  the  current  methods  of  determining  the 
target  library  for  RTID  applications. 


‘  This  work  was  partially  supported  by  the  MICOM  Research,  Development,  and  Engineering  Center  (Dr.  C.  Ray  Smith) 
under  the  auspices  of  the  US.  Army  Research  Office  Scientific  Services  Program  administered  by  Battelle  (Task  Control 
Number  94265,  Delivery  Order  Number  1313,  Contract  Number  DAAL03-91-C-0034). 
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1.  INTRODUCTION 

In  this  paper  we  consider  the  processing  of  measured  scattered  field  data  over  a  wideband  of  frequencies.  The 
scattering  from  any  target  illuminated  by  a  plane  wave  consists  of  contributions  from  several  different  scattering  centos, 
each  of  which  may  represent  a  different  scattering  mechanism.  These  scattering  mechanisms  may  be  specular  reflections, 
edge  diffractions,  aeeping  waves  or  multi-mode  resonances  associated  with  a  cavity.  Signals  from  all  of  these  diverse 
mechanisms  are  buried  in  the  measured  composite  signal.  In  addition,  error  sources  such  as  scattering  from  support 
structures  are  also  embedded  in  the  measured  data.  It  is  of  interest  to  delineate  these  scattering  centers  even  when  they  are 
close  together  and  further,  to  determine  their  frequency  dispersion  characteristics  resulting  in  a  one-dimensional  radar 
range  profile  of  the  target.  Such  a  decomposition  is  of  great  value  in  the  context  of  RCS  measurements  in  a  range.  It 
becomes  possible  to  edit  the  range  profile  in  the  frequency  domain  to  eliminate  the  spurious  scattering  centers  and  the 
mechanisms  associated  with  the  support  structures.  As  a  result,  the  true  scattered  signal  may  be  obtained  from  the 
measured  signal.  Such  editing  in  the  frequency  domain  is  superior  to  the  more  conventional  editing  carried  out  in  the 
time  domain.  Editing  in  the  time  domain  eliminates  the  skirts  of  the  responses  from  adjacent  scattering  centers.  Alro,  the 
frequency  dispersion  characteristics  determined  here  may  serve  as  distinct  “signatures”  facilitating  the  identification  of 
the  target. 

The  technique  used  here  consists  of  two  steps.  In  the  first  st^,  the  signal  model  assumes  the  scattenng  from  vanous 
scattering  centers  to  be  frequency  invariant.  This  model  is  then  used  to  determine  the  time  delays  associated  with  each  of 
the  scattering  centers.  The  MUSIC  [1]  algorithm  is  used  for  this  purpose.  As  a  result,  the  resolution  obtained  is  beyond 
that  possible  with  Fourier  based  techniques.  In  the  second  step,  the  time  delays  computed  in  the  first  stqj  are 
incorporated  into  the  signal  model.  In  addition,  the  frequency  dispersion  characteristic  associated  with  each  of  the 
scattering  centers  is  parameterized  and  is  no  longer  considered  frequency  invariant.  These  parameters  are  then 
determined  using  a  least  squares  approach.  This  technique  is  applied  to  a  variety  of  synthetic  and  measured  data  sets  to 
assess  its  strengths  and  limitations.  It  is  shown  that  the  method  works  very  well  and  that  it  is  j^ble  to  extract  any 
arbitrary  dispersion  characteristic.  Using  measured  scattering  data  from  an  ogive  and  sphere,  editing  of  a  radar  range 
profile  in  the  frequency  domain  is  demonstrated. 

2.  THEORY 

We  consider  here  a  target  illuminated  by  a  plane  wave  resulting  in  the  excitation  of  L  scattering  centCTs  on  the 
target.  Let  the  back  scattered  field  be  measured  over  a  band  of  frequencies  from  fi  to  Ik  at  equal  frequency  increments 
denoted  by  Af  .  The  received  signal  at  any  frequency,  f,  may  be  modeled  as, 

v(f)  =  £ak(f)e“j'’*'‘  +n(f)  2.1 

k=l 
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Here,  Rj.  (f ) ,  which  may  be  complex,  represents  the  dispersive  scattering  amplitude  and  is  the  round-trip  time  delay 
corresponding  to  the  k*  scattering  center.  The  measurement  noise  and  modeling  error  is  represented  by  n(f)  .  Our 
objective  here  is  to  estimate  the  time  delays,  tk,  k=l,  2,  ...  L,  and  the  corresponding  dispersion  characteristics, 
(f),  k  =  1,2 ...  L,  from  the  measured  data,  Vj  =  V  (f^),  i=l,2...N.  This  is  accomplished  in  two  steps.  In  the 
first  step,  the  signal  model  assumes  the  scattering  from  various  scattering  centers  to  be  invariant  with  frequency.  This 
model  is  then  used  to  determine  the  time  delays  associated  with  each  of  the  scattering  centers.  Here,  we  have  used  the 
MUSIC  algorithm.  This  results  in  super  resolution,  making  it  possible  to  separate  scattering  centers  that  are  very  close  to 
each  other  in  range.  It  may  be  mentioned  that  any  other  super  resolution  technique  could  be  used.  In  the  second  step,  the 
time  delays  computed  in  the  first  step  are  incorporated  into  the  signal  model.  In  addition,  the  frequency  dispersion 
characteristic  associated  with  each  of  the  scattering  centers,  a^(f),is  parameterized  and  is  no  longer  considered 
frequency  invariant.  These  parameters  are  then  determined  using  a  least  squares  approach. 

2.1  Step-1:  Time  Delay  Estimation 

Here  we  take  the  (f )  to  be  frequency  invariant,  i.e.,  a^ .  The  signal  model  of  equation  2.1  then  becomes, 

i=  1,2...N  2.2 

l£=l 


Equation  2.2  may  be  expressed  more  compacdy  as, 

v=Wa-hn,  2.3 

where,  v=[Vi,  v^, 

n=[iii,  n2,...nfj}^ 

W  =  [Wi,W2,...,Wl] 

]^ ,  k  =  1,2, . . .  N 

and  a  =  [ai,  Ej, 

The  superscript,  T,  indicates  transposition.  Under  the  assumption  that  n  is  white  with  variance  and  uncorrelated 
with  the  scanered  fields,  the  correlation  matrix,  R,  is  given  by. 


R,  =  E[ V  v”  ]  =  W  R j  W“  +0^  I,  2.4 

where,  Rj=E[ a  a”]  and  I  is  the  identity  matrix.  The  superscript,  H,  indicates  conjugation  and  transposition.  Since 
the  vector  of  scattering  amplitudes,  a ,  is  deterministic  and  not  random, 

Rj=E[aa”]=aa“  2.5 

Rj,  therefore,  is  a  LxL  matrix  of  rank-1  and  is  singular.  As  a  result,  the  MUSIC  algorithm  cannot  be  applied 
directly  without  further  modifications,  even  though,  equation  2.4  has  the  appropriate  form.  Shan  et.  al.  [2]  have  devised 
a  scheme  using  spatial  smoothing  to  decorrelate  a  coherent  signal  environment  in  the  context  of  angle  of  amval 
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estimation.  A  similar  “frequency  smoothing”  scheme  may  be  employed  here  and  is  discussed  in  [3].  This  procedure 
consists  of  the  following  steps: 

a)  A  subaperture  of  size  N'  is  chosen.  Then,  from  the  signal  vector  v,  M  =(N  —  N^+1)  different  signal  vectws, 
,  k  =  1,2, . . .  M,  each  of  length  N',  may  be  chosen.  The  signal  vector  correspcmding  to  the  k*  subaperture  is 

obtained  from  rows  kto  (N'  +  k  — 1)  of  vand  is  defined  by, 

=  v(k:N'+k-l) 

k=l 

=  2.6 

where  W'=[W;,W2' ,  .  .  .  W^] 

W'  g-j25tf2'k 

’  *^k+l  >  •  ’  •  ^N'+k-J 

D  =  diagCe"^^”^' 

The  correlation  matrix  corresponding  to  each  is  given  by, 

R,,  =E[v'(v'f] 

=  W'  Rj  W'“  +  <5^1  2.7 

=  W'R'^  W'”  +a"l 

b)  The  averaged  correlation  matrix  of  size  N'  x  N"  is  defined  by, 

R=J_y  w'R'  w  ^'+o^i 

Mti 

=  W'R'W'  +  a^I  2.8 

where,  r;=^|;r;,, 

M  S 

It  has  been  shown  in  [2]  and  [3],  that  R 1  is  not  singular  and  that  the  larger  the  number  of  “subapertures”,  M,  the  greater 
the  degree  of  decorrelation  achieved,  though,  attained  at  the  expense  of  reduced  size  of  the  correlation  matrix.  Equation 
2.8,  now,  is  in  a  form  suitable  for  analysis  using  the  MUSIC  [1]  algorithm. 

The  space  defined  by  the  eigenvectors  corresponding  to  the  lowest  (N^  — L)  eigenvalues  of  Rjis  the  ndse 
subspace,  Qn.  That  is, 

QN=[qL.iqLf2-qN']  2.9 

where  are  the  eigenvectors  of  R  j .  The  MUSIC  spectrum,  then,  is  defined  by. 
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2.10 


^MUSIC  (^d)  “  SH 


s^QnQSs 


where  the  scanning  vector  S  is  given  by, 

S  =  g- ^  _  e-j2rfN-td  ]  2.1 1 

The  peaks  of  this  spectrum  correspond  to  estimates  of  the  round  trip  delays  associated  with  the  L  scattering  centers. 

2.2  Step-2:  Dispersion  characteristics  of  scattering  centers 

We  now  consider  the  procedure  for  determining  the  scattering  amplitude  response  at  the  p'’*  frequency 
Et  (fp  ),  k  =  1,2,-  •  -L.  Towards  this  end  we  consider  a  band  of  frequencies  of  width  and  centered  at  fp .  Let  there 

be  (N  +  1)  samples  of  data  over  this  band.  The  signal  model  corresponding  to  this  band  of  frequencies  is  given  by, 

=  l=p-|-.-.p+^  2.12 

k=l  ^  ^ 

Note  that  the  scattering  amplitudes  are  not  constant  as  in  equation  2.2  and  the  round  trip  time  delays  of  equation  2.1  are 
deemed  to  be  known  and  replaced  by  their  estimates,  .  We  assume  a  functional  form  for  the  k*  scattering  amplitude 
function  a^.  (f ).  These  functional  forms  may  be  different  for  different  k,  but  here,  we  consider  a  polynomial  of  order  q , 
as  given  by. 


Ej,  (f  )=ato+at,f +at2f  ^+. . .+  ^  2.13 

The  problem  of  obtaining  the  dispersion  characteristics  may  now  be  formulated  as  the  determination  of  the  parameters 
Ejj,  k  2=  1,2,*-*L  and  l=0,l,2***q,that  results  in  the  best  match  between  the  measured  data  and  the  signal  model 

in  equation  2.13  in  the  least  squares  sense.  The  measurement  vector,  v  =  [Vq  Vj  . ..  Vjj]^  may  now  be  expressed  as, 

v  =  W^a  +  £  2.14 

where,  W  =  [Wo  Wj ,..%] 

e  =  [eo,ei,...£Nf 

w:=[wiw:,...wi] 

] ,  j  =  1,2,. . .  L 

a  =  [aio  ...aiq,a2o...a2q,...,aLo...aLq]^ 

The  parameter  vector  a  may  now  be  determined  by  requiring  the  error  power  J  =  ||e|p  to  be  the  least  possible.  This 
optimum  vector  may  be  shown  to  be  given  by  [4], 

a=(W*W"'r^W*v  2.15 
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Once  this  parameter  vector  is  defined,  the  dispersion  characteristics  evaluated  at  a,j(fp),  k  — are  readily 
determined.  The  firequency  then  is  incremented  fi’om  fp  to  (fp  +  Af )  and  the  procedure  is  repeated  to  determine  the 
firequency  response  at  this  new  fi'equency.  The  firequency  response  over  the  entire  band  may  be  determined  in  this 

fashion,  except  for  small  bands  of  width  ~  at  the  edges  of  the  band.  Consideration  for  proper  choice  of  B,  and  other 
parameters  are  discussed  in  the  next  section. 

3.  RESULTS 

The  theory  described  in  Section-2  is  used  to  process  a  variety  of  synthetic,  experimentally  obtained  and 
theoretically  predicted  scattered  fields.  The  results  obtained  depend  upon  the  choices  made  for  the  regression  bandwidth, 
,  the  firequency  increment,  Af  ,  and  the  number  of  subapertures,  M ,  used  to  decorrelate  R  j .  The  condition  number 

of  (W*W^)  depends  upon  B^  .  The  condition  number  is  «  1.0  if  B^  is  too  small  and  equation  2.15  becomes  ill- 
conditioned  resulting  in  degraded  estimates  of  scattering  center  frequency  responses.  The  regression  bandwidth 
corresponding  to  the  first  maximum  in  the  plot  of  B^  vs.  condition  number  of  (W  W  )  is  the  optimal  choice.  The 
choice  of  Af  determines  the  rate  of  sampling  of  the  scattered  fields  in  the  frequency  domain  and  as  such  dictates  the 
range  of  time  delays  that  are  not  aliased.  Af  must  be  chosen  so  that  the  expected  range  of  time  delays  corresponding  to 

the  scattering  centers  on  the  target  lie  between  —  and  .  Our  simulations  indicate  that  choosing  M,  the 

number  of  sub  apertures,  to  be  greater  than  6,  sufficiently  decorrelates  R  j . 


Illustration- 1:  Synthetic  Data: 

For  purposes  of  illustration,  we  generate  scattered  fields  by  prescribing  three  scattering  centers  vrith  specified 
round  trip  delays  and  corresponding  frequency  responses  (SNR=20  dB  and  M=16). 

The  first  scattering  center  is  at  -1  ns  and  its  frequency  response  is  linearly  increasing.  The  second  scattering 
center  is  resonant  with  center  frequency  at  10  GHz  and  is  located  at  0  ns.  The  third  scattering  center  is  located  at  1  ns 
and  its  frequency  response  is  linearly  decreasing.  Figure  1  shows  the  MUSIC  and  Fourier  spectra  overlaid  using  the  data 
in  the  band  7-13  GHz  with  a  Af  =0.2  GHz.  Figure  2  shows  the  specified  and  estimated  frequency  responses  overlaid. 
Tbe  regression  bandwidth  used  in  obtaining  these  results  is  1  GHz. 

Illustration-2:  Measured  8”  Diameter  Sphere 

The  field  scattered  by  a  sphere  involves  reflection  from  the  specular  point  and  creeping  wave  diffraction.  In  the 
case  of  measured  fields,  there  may  also  be  additional  scattering  mechanisms  associated  with  structures  used  for  mounting 
the  target.  We  process  here,  for  purposes  of  comparison,  the  scattered  field  computed  using  the  Mie  series  [5],  which,  of 
course,  does  not  have  any  spurious  scattering  mechanisms.  Figure  3  shows  the  MUSIC  spectrum  obtained  from  both  the 
measured  and  computed  scattered  fields.  The  offset  between  the  peaks  in  the  two  spectra  is  due  to  the  phase  center 
difference  between  the  8"  sphere  and  its  reference  target  and  is  of  no  consequence.  The  specular  and  creeping  wave 
returns  are  at  -0.65  ns  and  1.32  ns  respectively.  The  round  trip  time  delay  between  specular  and  creeping  wave  returns  is 
given  by. 


At  =  radius  *  (2  +  7t)/c .  3.1 

The  measured  results  give  a  result  of  1.97  ns  and  compares  well  with  the  predicted  value  of  1.74  ns.  The  spectrum  due  to 
measured  data  also  shows,  corresponding  to  a  time  delay  of  -.44  ns,  a  return  due  to  the  support  structure.  The  frequency 
responses  associated  with  these  three  scattming  centers  are  shown  in  figure  4.  The  specular  and  creeping  wave  frequency 
responses  obtained  from  the  computed  data  is  also  included  in  figure  4.  Having  identified  the  scattering  from  the 
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spurious  scattering  center,  it  is  possible  to  edit  it  out  of  the  scattered  field.  This  reconstructed  scattered  field  from 
measured  data  and  the  Mie  solution  are  shown  in  figure  5. 

IUustration-3:  Measured  9.6"  Ogive 

We  next  consider  a  9.6”  ogive,  whose  scattered  field  is  measured  for  the  nose-on  angle  of  incidence.  The 
scattered  field  is  also  computed  using  the  body  of  revolution  code  CICERO  [6].  The  MUSIC  spectra  corresponding  to 
both  sets  of  data  is  shown  in  figure  6.  As  before,  the  offset  in  the  peaks  is  due  to  an  inconsequential  phase  offset  between 
target  and  reference.  In  addition  to  the  scattering  centers  corresponding  to  the  front  and  rear  tips,  measured  data  reveals 
two  additional  scattering  centers.  These  scattering  centers  are  due  to  the  foam  structure  used  to  mount  the  ogive. 
Estimated  frequency  responses  of  all  the  scattering  centers  is  shown  in  figure  7.  The  edited  scattered  field  with  the 
contribution  from  the  two  error  terms  removed,  and  the  scattered  field  computed  by  CICERO  are  overlaid  in  figure  8. 

4.  Conclosions 

This  study  has  presented  a  method  for  extracting  arbitrary-shaped  frequency  dispersions  of  scattering 
mechanisms  from  a  composite  signal.  The  method  demonstrated  how  spurious  scattering  sources  could  be  removed  from 
a  composite  signal  to  provide  the  "true"  scattering  of  a  target.  Unlike  Fourier  methods,  this  technique  performs  its  signal 
reconstruction  in  the  frequency  domain,  thus  avoiding  Fourier  transform  induced  errors  from  multiplicative  time  domain 
gating. 


The  method  showed  minor  degradation  to  the  reconstructed  scattering  of  the  targets  under  study.  This  can  be 
primarily  attributed  to  limiting  the  number  of  scattering  centers  foimd  by  the  MUSIC  algorithm.  By  allowing  MUSIC  to 
search  for  additional  mechanisms,  a  more  accurate  rendering  of  the  actual  scattering  may  be  found.  In  addition,  by 
tailoring  the  assumed  frequency  dispersion  functions  for  individual  scattering  mechanisms,  the  method  will  predict  the 
response  for  each  mechanism  more  accurately. 

The  signal  model  presented  here  assumed  that  the  phase  centers  of  the  mechanisms  under  study  remain^ 
constant  with  frequency.  However,  for  many  targets  such  as  antennas  and  radar  absorbing  materials,  this  assumption  is 
not  valid.  The  method  may  readily  accommodate  this  by  applying  the  MUSIC  algorithm  to  small  sub-bands  to  account 
for  the  frequency  dependent  time  delays,  of  course  at  the  cost  of  resolution. 
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Figure  1.  MUSIC  and  Fourier  Spectra  for  Synthetic  Data  Illustration 
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Figure  2.  Exact  and  Estimated  Frequency  Responses  for  Synthetic  Data  Dlusiration 
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MUSIC  Spectra  for  Measured  Sphere  and  Mie  Series 
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Figure  3.  MUSIC  and  Fourier  Spectra  for  Measured  Sphere 
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Figure  4.  Estimated  Frequency  Response  of  Scattering  Mechanisms  for  Measured  Sphere 
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Reconstructed  Frequency  Response  of  Sphere  and  Mie  Series 


MUSIC  Spectra  for  Measured  Ogive  and  Cicero 
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Figure  7.  Estimated  Frequency  Response  of  Scattering  Mechanisms  for  Measured  Ogive 
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Figure  8.  CICERO  and  Corrected  Frequency  Response  for  Measured  Ogive 
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USING  THE  E-PULSE  TECHNIQUE  AND  HYPOTHESIS  TESTING  TO  PERFORM 
RADAR  TARGET  IDENTIFICATION 


Lloyd  Riggs,  Jon  Mooney,  and  C.  Ray  Smith 
Department  of  Electrical  Engineering 
200  Broun  Hall  •  Auburn  University,  Alabama  36849 


Abstract 

In  this  paper,  the  Extinction  pulse  (E-pulse)  technique  is  combined  with  hypothesis  testing  to 
perform  target  identification.  Based  upon  the  natural  resonances  of  a  conducting  body,  the  E-pulse 
is  constructed  to  annihilate  the  natural  modal  content  of  the  scattered  field  response  from  a  specific 
target.  Convolving  the  same  E-pulse  with  the  impulse  response  from  a  dissimilar  target  yields  a  nonzero 
response.  Thus,  the  E-pulse  scheme  reduces  the  identification  process  to  discriminating  between  no 
response  and  a  “large”  response.  However,  in  a  real  environment  where  noise  will  inevitably  corrupt 
the  radar  signature,  the  robustness  of  the  E-pulse  technique  is  diminished.  Convolving  the  E-pulse 
with  the  signature  from  it’s  intended  target  in  the  presence  of  noise  will  not  produce  a  null  late-time 
response.  Although  the  E-pulse  scheme  definitely  has  merit,  it  alone  cannot  completely  resolve  the 
target  ID  problem,  especially  under  the  infiuence  of  such  nuisance  parameters  as  noise  and  aspect 
angle.  One  possible  method  of  overcoming  these  parameters  while  retaining  the  E-pulse  approach  is 
to  conduct  a  binary  hypothesis  test  at  the  output  of  the  E-pulse  filter.  Such  a  test  will  yield  a  measure 
of  belief  (a  probability)  as  to  whether  the  output  is  either  “noise”  (correct  target)  or  “signal  plus 
noise”  (wrong  target).  This  paper  will  expound  on  the  rationale  of  using  the  E-pulse  technique  and 
binary  hypothesis  testing  to  perform  target  identification. 


I  Introduction 

The  E-pulse  radar  target  discrimination  technique  is  based  on  the  singularity  expansion  method  (SEM) 
representation  of  the  transient  scattered  field  returned  from  a  target  which  has  been  illuminated  by  an 
“impulsive”  (wide  bandwidth)  radar  pulse.  The  SEM,  as  formulated  by  Baum  in  1971  [1],  represents 
electromagnetic  interaction  or  scattering  in  terms  of  simple  pole  terms  (or  singularities)  in  the  complex 
frequency  plane,  or  correspondingly,  as  simple  damped  sinusoids  in  time  domain.  For  example,  the  SEM 
representation  of  the  late-time-scattered  field  impulse  response  of  a  conducting  object  may  be  written  as 

N 

^(0  =  m  e'"”*  cos(a>„t  -h  ^„),  i  >  Tl  (1) 

Tl=l 

where  the  amplitude  and  phase  of  the  nth  mode,  and  ,  depend  on  the  orientation  of  the  target  with 
respect  to  the  radar  (aspect  dependent  parameters)  while  Wn  and  (<  0)  are  the  aspect-independent 
frequency  and  damping  constant  of  the  nth  mode,  respectively,  and  only  N  modes  are  assumed  excited 
by  the  incident  field  waveform.  Late  time,  denoted  by  t  >  Tc  ,  is  defined  as  the  time  period  after  the 
incident  pulse  has  passed  over  the  target,  so  that  subsequent  radiation  is  associated  with  the  target’s 
free  natural  resonances  [2].  Equation  (1)  is  constructed  using  what  is  referred  to  as  a  class  I  coupling 
coefficient  [3].  An  SEM  representation  for  the  scattered  field  employing  a  class  II  coupling  coefficient 
may  also  be  constructed  and  enjoys  the  advantage  of  greater  accuracy  than  the  class  I  form  in  early  time, 
0  <  t  <  Tx,  ,  albeit  at  the  expense  of  greater  complexity  [4]. 
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The  E-ptilse  and  related  single  mode  extraction  pulse  (S-pulse),  are  purported  to  be  aspect  indepen¬ 
dent  discriminatory  waveforms  which,  when  convolved  with  the  late-time  response  of  a  matched  target, 
produce  null  or  single-mode  responses,  respectively.  When  an  E-pulse  waveform  tailored  to  one  target  is 
convolved  with  the  late-time  response  for  a  different  target,  a  nonzero  response  results.  Complete  details 
regarding  the  synthesis  of  E-  and  S-pulse  waveforms  will  not  be  discussed  here.  Instead,  the  interested 
reader  is  referred  to  a  review  article  on  the  subject  by  Baum  et.  al  [5].  A  simple  example  will  suffice  in 
providing  the  reader  with  an  adequate  understanding  of  the  E-pulse  discrimination  technique  relevant  to 
this  discussion. 

Figure  1  presents  a  block  diagram  of  a  system  designed  to  discriminate  between  a  pair  of  targets  using 
the  E-pulse  technique.  For  simplicity,  only  the  two  target  case  will  be  considered  since  generalization  to 
the  N-target  case  is  straightforward.  Designate  target  A  as  the  thin  cylinder  and  target  B  as  the  swept 


Figure  1:  Block  diagram  of  a  system  to  discriminate  between  targets  A  and  B  using  the  E-pulse  method. 

wing  aircraft.  A  suitable  wide  bandwidth  source  illuminates  the  unknown  target  (either  A  or  B)  and 
the  scattered  field  from  the  target  is  received  and  convolved  with  the  E-pulse  matched  to  target  B,  or 
equivalently,  as  depicted  in  Figure  1,  the  return  is  multiplied  (frequency  domain)  by  the  E-pulse  filter 
matched  to  target  B.  (Note  that  an  E-pulse  filter  matched  to  target  A/B  will  have  zeros  at  positions 
in  the  complex  plane  where  the  impulse  response  function  of  target  A/B  has  poles.  See  Eq.  (1)  above.) 
Figure  2a  shows  that  in  late  time  a  null  signal  results  at  the  output  of  the  E-pulse  filter  matched  to 
target  B  if  target  B  is  indeed  responsible  for  generating  the  return.  In  contrast.  Figure  2b  shows  that 
if  target  A  were  the  true  target  present  an  oscillatory  nonzero  return  is  observed.  In  short,  a  late-time 
nonzero  output  from  the  E-pulse  filter  matched  to  target  B  indicates,  for  this  simple  binary  discrimination 
problem,  that  target  A  is  present. 

Several  additional  comments  pertaining  to  the  results  in  Figure  2  are  in  order  before  proceeding. 
First,  referring  to  Figure  2a  one  may  observe  that  the  late-time  output  from  the  filter  is  zero  when  target 
B  is  present  independent  of  the  orientation  of  target  B  with  respect  to  the  transmitting  antenna.  When 
noise  is  present,  which  will  be  the  case  for  all  practical  systems.  Figure  2c  indicates,  as  expected,  that  the 
late-time  output  from  the  filter  is  just  noise  when  target  B  is  present,  and  this  noise  is  independent  of  the 
orientation  of  target  B.  On  the  other  hand,  when  target  A  is  present  the  output  from  the  filter  will,  in 
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general,  consist  of  a  target-orientation-dependent  signal  (see  Figure  2b),  or  when  noise  is  also  present,  a 
target-orientation-dependent  signal  plus  noise  (see  Figure  2d).  In  summary,  binary  target  discrimination 
using  an  E-pulse  filter  matched  to  target  B  requires  one  to  decide  between  two  distinct  conditions: 

1.  Only  noise  is  observed  in  which  case  the  declaration  is  made  that  target  B  is  present. 

2,  A  target-orientation-dependent  signal  plus  noise  is  present  in  which  case  the  declaration  is  made 
that  target  A  is  present. 


4  6  8 

Normalized  Time  (ct/L) 


Figure  2:  a.)  Impulse  response  of  target  B  filtered  with  the  E-pulse  for  target  B.  b.)  Impulse  Response 
of  target  A  filtered  with  the  E-pulse  for  target  B.  c.)  Impulse  Response  of  target  B  with  15%  random 
noise  filtered  with  the  E-pulse  for  target  B.  d.)  Impulse  Response  of  target  A  with  15%  random  noise 
filtered  with  the  E-pulse  for  target  B. 
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II  Hypothesis  Testing 

In  this  section  the  hypothesis  testing  problem  outlined  in  the  previous  paragraph  will  be  examined  in 
greater  detail:  that  is,  we  seek  to  specify  a  decision  rule  based  on  the  data  stream  at  the  output  of  the 
E-pulse  filter  which  wdU  enable  us  to  ’’determine”  which  of  the  two  possible  targets  is  present. 

According  to  Van  Trees  [6],  there  are  four  components  to  the  hypothesis  testing  problem.  First,  we 
have  a  source  w’hich  generates  an  output,  the  output  is  one  of  two  choices,  and  these  choices  are  referred 
to  as  hypotheses.  The  source  (of  data)  here  is  the  transient  radar  return  from  one  of  the  targets  and  we 
define  the  mutually  exclusive  and  exhaustive  hypotheses: 

Ha  :  target  A  generated  the  radar  return. 

Hb  ;  target  B  generated  the  radar  return. 

In  terms  of  the  impulse  response  functions  of  targets  A  and  B  one  may  write: 

Ha:  yp.{t,e)  =  J  TA{t-t')f{t')dt'  (2) 

Hb  :  =  J  rs(t- (3) 

where  yA{i,  and  are  the  aspect  dependent  returns  from  target  A  and  B  respectively  expressed 

in  terms  of  a  convolution  of  the  incident  waveform,  /(f),  with  the  impulse  response  function  associated 
with  the  targets  rA{i,d)  and  rB(f,  ^).  It  should  be  pointed  out  that  although  r{t,9)  is  referred  to  as  the 
impulse  response  function  of  the  target  (either  A  or  B)  it  might  just  as  well  represent  the  effect  of  many 
system  components  in  cascade: 

r{t,  9)  =  hi{t)  *  /12(f)  *  hsit) ...  (4) 

(*  denotes  convolution)  where  /2.1(f)  could  be  the  impulse  response  of  the  transmitting  antenna,  /12(f) 
could  represent  the  impulse  response  of  the  target,  hz{t)  that  of  the  receiving  antenna,  etc.  Of  course  if 
the  impulse  response  of  the  E-pulse  were  included  in  (4),  then  the  output  in  the  absence  of  noise  would 
be  either  zero  or  a  target-aspect-dependent  signal  as  discussed  above. 

The  second  component  of  the  problem  is  a  probabilistic  transition  mechanism,  and  the  third  is  an 
observation  space.  Probability  enters  the  problem  because  of  an  unavoidable  noise  component  which  is 
added  to  the  radar  return.  When  the  E-puIse  filter  is  employed  one  cannot  say  with  complete  confidence 
that  the  output  is  just  noise  (Hb)  or  signal  plus  noise  (Ha).  Each  target  return  is  mapped,  via  the 
addition  of  noise,  to  some  point  in  an  observation  space.  The  observation  space  can  be  view'ed  as  the  set 
of  all  possible  signals  at  the  output  of  the  E-pulse  filter.  The  fourth  and  final  component  of  the  detection 
problem  is  a  decision  rule.  After  making  an  observation,  a  decision  is  made  as  to  which  hypothesis  is 
true.  Such  a  decision  results  in  the  assignment  of  each  point  in  the  observation  space  to  one  of  the 
hypotheses. 

Assuming  observations  are  made  at  the  output  of  the  E-pulse  filter  matched  to  target  B,  in  the 
presence  of  noise,  the  hypotheses  may  be  rewritten  as 

Ha:  d{t,9)  =  yxit^O)  +  n{t) 

Hb  :  d(f :  9)  —  n{t) 

where  d(t,  9)  is  the  data  stream,  and  we  take  the  probability  of  a  given  noise  function  n(f)  to  be 


166 


(zero  mean  Gaussian  noise)  and  <7^  is  the  noise  power.  Note  that  p(n{i)jo-)  is  read:  the  probability  of 
observing  a  given  noise  signal  n.(/)  given  the  noise  power  .  Now  if  target  A  is  the  true  target  present, 
then  the  probability  of  getting  a  given  data  function  d{t,  9)  can  be  found  by  solving  for  the  noise  n{t)  in 
terms  of  yAiiJ)  and  d{t,0)  and  substituting  into  Eq.  (5) 

p(D|A,0,ct)  =  -^;^exp|-~  j  [d(t,9)~yAii,0)]'^  ■  (6) 

Once  again,  (6)  is  read;  the  probability  of  getting  a  given  data  function  (D)  given  that  target  A  is 
present  at  an  orientation  angle  9  with  respect  to  the  transmitting  antenna,  and  the  noise  power  is 
Equations  (5)  and  (6)  are  referred  to  as  the  sampling  probabilities  for  the  problem  and  play  a  pivotal 
role  in  constructing  the  desired  decision  rule. 

Ill  Constructing  a  Decision  Rule  (Bayes  Criterion) 

In  the  binary  hypothesis  problem,  we  know  that  either  Ha  or  Hb  is  true,  and  so  making  a  decision  can 
result  in  only  four  possible  outcomes: 

Target  A  is  chosen,  and  target  A  is  the  true  target. 

Target  A  is  chosen,  and  target  B  is  the  true  target. 

Target  B  is  chosen,  and  target  B  is  the  true  target. 

Target  B  is  chosen,  and  target  A  is  the  true  target. 

Obviously,  the  first  and  third  alternatives  are  correct  choices,  and  the  second  and  fourth  are  incorrect 
choices.  In  general,  one  would  expect  that  the  decision  rule  (the  method  used  to  process  the  data)  to 
depend  upon  the  relative  importance  of  the  four  possible  outcomes.  Costs  ,  Cjj  ,  are  therefore  assigned  to 
each  of  the  four  possible  outcomes  with  Cy  representing  the  cost  of  choosing  hypothesis  i  when  hypothesis 
j  is  actually  true.  One  would  assume  that  an  incorrect  decision  would  exact  a  greater  toll  (have  a  greater 
cost)  than  a  correct  decision  so  that  Caa  <  Cab  and  Cbb  <  Cba  • 

A  Bayesian  decision  rule  [6]  assumes  that  the  source  outputs  are  governed  by  probability  assignments 
Pa  and  Pb  which  are  referred  to  as  prior  probabilities  and  represent  the  state  of  knowledge  about  the 
outcome  (which  target  is  present)  before  the  data  is  received.  In  the  absence  of  prior  knowledge,  one 
would  assign  Pa  =  Pa  =  |-  Our  decision  rule  should  be  constructed  so  that  on  the  average  the  cost  will 
be  as  small  as  possible.  It  certainly  seems  intuitively  correct  that  the  risk  (the  average  value  of  the  total 
cost)  should  be  proportional  to:  (1)  the  individual  cost  Cy,  (2)  the  probability  that  target  j  is  present 
before  getting  any  data  (prior  information)  Pj,  and  (3)  the  probability  that  we  declare  that  target  i  is 
present  given  that  target  i  is  actually  present.  Taking  these  three  factors  into  account,  an  expression  for 
the  risk  (total  average  cost)  becomes 

3?  =  CAAPAP(say  Aj  A,  ^,o-)  +  CABPBP(say  A1B,5,(7) + 

CBBPBP(say  B  I  B,  a)  +  CBAPAP(say  B  |  A,  0,  a)  (7) 

which  can  be  viewed  as  a  rule  for  dividing  the  observation  space  into  regions  Za  (where  we  declare  that 
target  A  is  present)  and  Zb  (where  we  declare  that  target  B  is  present). 
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Assuming  that  the  observation  space  has  been  divided  into  regions  Za  and  Zb  the  risk  can  be  written 
in  terms  of  the  previously  defined  transition  probabilities  (Eqs.  (5)  and  (6))  as 


=  CaaPa  /  p(D  I  +  CabPb  /  KD  I  + 

JZx  JIk 

CbbPb  f  p(D  I  B,  0,  (7)dD  +  CbaPa  /  p(D  [  A,  0,  a)<iD. 

JZ-R  JZ-R 


(8) 


Since  the  entire  observation  space  is  Z  =  Za  +  Zb  the  integrals  over  Zb  can  be  replaced  with  integrals 
over  Z  —  Za,  and  additionally,  the  probability  density  functions  must  have  the  property  that 


f  p(D|  A,0,£7)dD  =  /  p(D|B,0,fT)dD  =  1. 

JZ  Jz 

Using  Eq.  (9)  in  Eq.  (8)  allows  one  to  express  the  risk  as 

=  PaCab  +  PbCbb  +  /  {PB(CAA-CBB)pfD|B,0,cr)  - 
JZa 

PA(CBA-CAA)p{DiA,0,a)}dD. 


(9) 


(10) 


The  first  two  terms  in  Eq.  (10)  represent  the  fixed  cost.  As  mentioned  above,  the  cost  of  making  an 
incorrect  declaration  is  greater  than  the  cost  of  making  a  correct  one,  or  Cab  >  Cbb  and  Cba  >  Caa- 
Thus,  the  terms  in  parentheses  in  Eq.  (10)  must  be  positive.  Furthermore,  since  the  probability  density 
functions  in  Eq.  (10)  are  positive,  minimizing  the  risk  requires  that  Za  be  that  portion  of  the  observation 
space  which  satisfies 


Pa(Cba  -  Caa)KD  I  A,  0,  ^r)  >  Pb(Caa  -  Cbb)KD  !  B,  9,  a).  (11) 


In  other  words,  if,  after  receiving  the  data,  the  inequality  in  Eq.  (11)  holds,  then  we  declare  that  target  A 
is  present.  If  the  inequality  is  violated,  then  we  declare  that  target  B  is  present.  Alternatively,  Eq.  (11) 
may  be  manipulated  to  yield 

KD|A,^,,t)  Pb(Cab  -  Cbb)  ,, 

y(D|B,e.<7)  ,<  Pa(Cba-Caa) 

where  the  quantity  on  the  left  is  called  the  likelihood  ratio  and  that  on  the  right  the  threshold.  Equa¬ 
tion  (12)  is  the  desired  decision  rule  and  is  often  called  a  threshold  test.  Our  data  processor  evaluates 
the  likelihood  ratio,  and  if  this  likelihood  ratio  exceeds  the  threshold,  then  we  declare  that  hypothesis 
Ha  is  true.  Otherwise,  if  the  likebhood  ratio  is  less  than  the  threshold,  then  hypothesis  Hb  is  declared 
true. 


IV  Threshold  Testing  The  Output  of  the  E-pulse  Filter 

In  this  section  the  threshold  test  is  examined  for  the  case  of  uniform  priors  and  uniform  cost.  In  addition, 
the  dependence  of  the  threshold  test  on  target  aspect  is  considered. 

The  assumption  of  uniform  priors  implies  that  Pa  =  Pb  =  |  (before  getting  any  data  target  A  is 
just  as  likely  to  be  present  as  target  B),  and  if  the  cost  are  stiU  assumed  uniform,  Caa  =  Cbb  =  0  and 
Cab  =  Cba,  then  the  threshold  becomes  unity.  If  target  A  is  present,  then  the  probability  of  observing 
a  given  data  function  D  is  given  by  Eq.  (6).  On  the  other  hand,  if  target  B  is  present,  the  probability  of 
getting  a  given  data  function  becomes 
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where  Eq.  (5)  has  been  used  with  the  noise  replaced  by  the  data  which  is  true  when  target  B  is  present. 
Substituting  Eqs.  (5)  and  (13)  into  Eq.  (12)  and  taking  the  logarithm  of  both  sides  allows  the  threshold 
test  to  be  written  as 

r  \  r 

j  d(t)yK(t,e)dt  %  ^JviMdt.  (14) 


Note  that  in  Eq.  (14)  the  orientation  of  the  target,  0,  is  assumed  known.  The  new  form  of  the  decision 
rule  in  Eq.  (14)  is  based  on  a  comparison  of  energies.  The  right  hand  side  of  Eq.  (14)  is  just  one  half  the 
total  late  time  energy  in  the  signal  at  the  output  of  the  E-pulse  filter  when  target  A  is  present.  If  target 
A  is  actually  present,  the  left  hand  side  of  Eq.  (14)  should  exceed  the  threshold  which  is  certainly  true 


j  d{i)yA(t,9)dt  =  j  \y„(tM  dt  +  jn{t)yi,(t,e)dt, 


(15) 


and  the  right  most  integral  is  zero  since  the  noise  is  as  likely  to  be  positive  as  negative.  On  the  other 
hand  if  target  B  is  present  then  the  output  of  the  E-pulse  filter  will  just  be  noise,  d{t)  =  n(t),  and  the 
left  hand  side  of  Eq.  (14)  has  an  expected  value  of  zero  which  is  less  than  the  threshold. 

As  mentioned  above,  Eq.  (14)was  derived  under  the  assumption  that  the  target  aspect,  9,  is  known. 
When  this  is  not  the  case,  9  is  referred  to  as  a  nuisance  parameter  [7]  or  unwanted  parameter,  and  a 
technique  known  as  marginalization  must  be  employed.  In  effect,  marginalization  allows  one  to  take  into 
account  all  possible  target  orientations  as  demonstrated  below. 

According  to  Eq.  (6),  p(D  j  A,  9,  a)  represents  the  probability  of  getting  a  data  set  D  given  that  target 
A  is  present  at  an  orientation  angle  9  with  noise  power  a^.  However,  if  the  dependence  on  target  aspect 
is  to  be  eliminated  we  actually  desire  p{D  |  A,  cr),  which  is  obviously  independent  of  9.  The  latter  may 
be  obtained  from  the  former  by  applying  what  essentially  amounts  to  the  elementary  sum  and  product 
rules  of  probability  theory,  that  is 


P(D|A,c7)=  jp{D,9\A,a)de  =  J  p{D\  A,9,ct)p{9\  A,a)d9. 


(16) 


As  pointed  out  by  Jaynes  [7],  “This  is  a  weighted  average  of  all  possible  values  of  p(D  |  A,  9,  cr),  weighted 
according  to  the  prior  probability  p{9\A,  cr).”  Of  course  Eq.  (16)  cannot  be  evaluated  until  the  prior  for 
the  target  orientation,  p{9  |  A,cr),  has  been  specified. 

As  an  example,  assume  a  uniform  probability  distribution  for  the  target  in  the  form 


p(D  I  A,  cr) 


0  elsewhere. 


(17) 


Now  using  Eqs  (17)  and  (6)  in  Eq.  (16)  yields 

P(»l  A,-)  =  exp{-^/  [i(t,9)  -  dt]  de.  (IS) 

The  reader  should  be  aware  that  other  probability  density  functions  for  the  target  angle  9  may  also 
be  appropriate.  In  fact,  if  the  target  orientation  is  known,  by  some  other  means,  to  be  within  a  small 
angular  sector  about  9  then  Eq.  (18)  is  evaluated  by  sampling  the  integrand  at  that  value  of  9  and  the 
previous  form  of  the  ratio  test  given  by  Eq.  (15)  results.  If,  on  the  other  hand,  very  little  knowledge  on 
the  target  orientation  is  available,  then  the  integral  in  Eq.  (18)  might  extend  over  a  wide  range  so  that 
all  the  ^-dependent  data  would  be  taken  into  account.  Using  Eqs.  (18)  and  (5)  in  Eq.  (12)  results  in 
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the  final  form  for  the  ratio  test  taking  into  account  all  the  data  for  target  orientations  within  the  range 
9i  6  <i  $2 

CT  Ih  {-  5^  J  ld(t,  e)  -  i,A  (i,  6)fdt]de 
V  Concluding  Remarks 

In  this  paper  an  overview  of  the  E-pulse  target  discrimination  technique  has  been  provided.  To  the  best 
of  our  knowledge,  this  work  represents  the  first  time  that  the  E-pulse  identification  scheme  has  been 
investigated  from  the  point  of  view  of  probability  theory.  We  point  out  that  the  E-pulse  and  related 
S-pulse  target  identification  schemes  have  been  purported  not  to  depend  upon  target  orientation.  The 
reason  for  this  assumption  is  based  on  the  fact  that  when  a  target  is  present  the  output  of  the  E-pulse 
filter  matched  to  that  target  is  indeed  noise,  independent  of  the  matched  target’s  orientation.  However, 
when  probability  theory  and  standard  hypothesis  testing  techniques  are  applied  to  the  problem,  target 
orientation  must  once  again  be  considered.  The  reason  is,  as  discussed  above,  the  output  of  the  E-pulse 
filter  when  the  “incorrect”  target  is  present  is  aspect  dependent.  One  reason  for  the  confusion  may  be 
related  to  the  erroneous  assumption  that,  in  general,  the  E-pulse  scheme  allows  one  to  compare  a  large 
signal  (target  not  matched  to  the  filter)  to  a  small  signal  (noise),  matched  case.  This  condition  may  in 
fact  prevail,  but  only  in  high  signal-to-noise  ratio  scenarios.  (If  the  signal-to-noise  ratio  is  large  enough, 
in  may  be  more  convenient  to  carry  out  identification  visually!)  When  the  signal-to-noise  ratio  begins  to 
fall  the  matched  response  (noise)  will,  to  the  human  operator,  began  to  closely  resemble  the  unmatched 
response  and  one  must  resort  to  more  appropriate  means  of  signal  detection  (e.g.  probability  theory). 
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A  BOUNDARY-INTEGRAL  CODE  FOR  ELECTROMAGNETIC  NONDESTRUCTIVE 
EVALUATION 
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4635  Morningside  Drive 
Bloomington,  IN  47408 

INTRODUCTION 

Volume-integral  equations  have  proven  to  be  very  successful  in  the  computation  of  eddy-current 
probe-flaw  responses  for  nondestructive  evaluation  (NDE)  problems  having  a  number  of  simple 
geometries.  This  approach  to  NDE  computations  has  proven  superior  to  the  finite-element  approach 
in  both  accuracy  and  computer  resources  required,  and  is  the  basis  of  our  proprietary  code  VIC- 
3D^  The  volume-integral  approach,  however,  is  not  as  weU  adapted  to  accommodating  the  complex 
geometries  sometimes  required  in  practical  applications.  An  example  is  the  separation  of  edge  and 
corner  effects  from  the  response  of  a  flaw.  We  will  discuss  an  extension  of  the  volume-integral 
approach  that  incorporates  boundary-integral  equations  to  provide  a  description  of  complicated 
surface  geometries. 

BACKGROUND 

Requirements  for  eddy-current  probe  simulation  software  for  the  aerospace  industry  include 

1.  Coil  Modeling: 

.1  conventional  absolute  and  differential  coils  (side  mount  and  end  mount), 

.2  single  eddy-current  array  probe  (ECAP)  coils  (air  core  and  ferrite  backed), 

.3  ECAP  arrays  (air  core  and  ferrite  backed),  including  interactions  between  coils. 

2.  Part  Geometry  Modeling  (aircraft  engine  components): 

.1  sharp  edges, 

.2  chamfered  and  filleted  edges, 

.3  bolt  holes, 

.4  dovetail  contours. 

3.  Defect  Modeling: 

.1  EDM  notches, 

.2  idealized  cracks  away  from  edges, 

.3  idealized  cracks  at  edges, 

.4  pits  and  other  geometrically  defined  defects. 

4.  Coil  Surface  Interaction: 

.1  compute  the  complex  impedance  due  to  coil  surface  interaction 

^VIC-3D  is  a  registered  trademark  of  Sabbagh  Associates,  Inc. 
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Figure  1:  Electromagnetic  NDE  problem  to  be  solved.  Region  1  contciins  the  probe  coil,  and  region 
2  the  workpiece  (or  host)  and  flaw. 

.2  compute  lift-off  resulting  from  any  combination  of  coil  and  geometry  models 
.3  calculate  complex  impedance  as  a  function  of  drive  frequency. 

VIC-3D  already  solves  most  of  these  problems  quite  efficiently  using  volume-integral  equations 
[1-3].  We  propose  to  couple  boundary-integral  equations  to  the  volume-integral  equations  to  en¬ 
hance  VIC’s  ability  to  solve  the  complex  geometries  of  item  2,  while,  at  the  same  time,  distinguishing 
flaws  at  edges  and  corners,  as  described  in  item  3. 

THE  TWO  SYSTEMS  OF  COUPLED  INTEGRAL  EQUATIONS 

We  wiU  derive  the  equations  for  the  genercd  electromagnetic  NDE  problem,  which  consists  of 
two  regions,  as  in  Figure  1.  The  first  is  the  exterior  of  the  workpiece,  contains  the  probe  coil, 
and  consists  of  free-space,  whereas  the  second  is  the  finite  workpiece  which  contains  a  flaw.  The 
boundary  of  region  2  is  labeled  S.  The  “incident”  fields  established  by  the  volume  currents  produce 
“scattered”  fields  at  boundary  S.  We  want  to  calculate  the  scattered  fields  by  using  boundary- 
integral  equations. 

We  use  the  equivalence  principle  of  Figure  2  to  determine  various  boundary-integral  equations. 
The  electric  surface  current,  J^,  and  magnetic  surface  current,  Kg,  of  Figure  2(b)  are  fictitious 
sources  for  the  scattered  and  transmitted  fields,  and  Sj,  serve  the  same  purpose 

for  in  Figure  2(c).  The  “transmitted”  fields,  those  with  the  superscript  t,  are  auxiliary 

fields,  whereas  the  “scattered”  fields,  those  with  the  superscript  s,  are  actual.  We  define  the  surface 
currents  by  =  a„  x  H,  and  =  -an  x  E. 

One  can  derive  a  host  of  boundary-integral  equations  by  choosing  the  auxiliary  fields  appropri¬ 
ately.  For  example,  if  we  let  a„  x  Ef^  =  0,  an  x  E^*^  =  0,  a„  x  =  0,  a„  x  =  0,  which 
is  Love’s  equivalence  theorem,  then  it  can  be  easily  shown  that  =  -3^  -  J*,  Kg  =  ~Kd  -  Kg. 
Using  these  facts  we  can  derive  the  following  system: 

ju;(€2  +  eo)Kg(r)  =  2ya;an  X  (coE[’^  -  €2E^‘^) 
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Figure  2:  Equivalence  principle,  (a)  Original  problem,  (b)  External  equivalence  case,  (c)  Internal 
equivalence  case. 


-an  X  /  [3s{-k2<i>2  +  +  JwKs  X  V'(e2<?i>2  -  •  JsV'(^2  -  4>i)]dS'  (1) 

23r  Js* 

jLo{fio  +  ;i2)Js(r)  =  jw2an  X 

+a„  X  7^  /  [Ks(-^:202  +  ^o<^l)  -  X  V'(/i2<?5’2  -  fio<f>i)  +  V'  •  KaV\<l>2  -  <f>l)]dS' 

2jr  Js* 

(2) 

J,(r)-cr.(r)  /  ^rWr')  •  J.(rOrfv' = 

./flaw 

-cra(r)^  f  [j(ji>fi23s4>2(r\r')  +  X  V'<^2(r|r')  +  *  JiV'<?i2(r|r')]<i5''  .  (3) 

47r  Js 

A  second  system  of  boundary-integral  equations  can  be  obtained  by  letting  the  tangential  com¬ 
ponents  of  a„  X  =  an  X  which  impEes  that  Ke  =  0,  as  can  be  seen  from  Figure  2(b). 
Similarly,  let  an  xE^‘^  =  an  xE|*\  which  means  that  Kd  =  0.  Hence,  the  magnetic  surface  currents 
vanish,  and  we  are  left  with  only  electric  surface  currents  to  be  determined.  The  equations  for  these 
currents,  Jg,  Jj,  are  obtained  by  returning  to  Figure  2(a),  and  equating  the  tangential  components 
of  the  total  electric  and  magnetic  fields.  The  result  is 


2a„  X  (nf  -  H«)  = 


A  /  +  jAv'  •  JjVV2(r|r')|  rfS'l  (4) 

4'!r  Js  L  J<^€2  J  /tan 

J,  -  »»(>•)  X  (J.  ^  VViCrlr')).;^' 

Jj+ij(a,(r)x(JrfXVV2(r|r'))<;5'  (5) 
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Ja(r)  -  <7.(r)  sr{t\r')  ■  J,(r')<lf'  =  ^  [ja./x,Ji*(r|r')  +  •  JjV'A!(r|r')]  dS' 

(6) 

where  r  €  5  in  the  boundary-integral  operators  of  (1),  (2),  (4),  and  (5),  V'*  is  the  surface 
divergence,  and  fg,  denotes  the  principal  value.  The  scalar  Green’s  functions  are 

g-jio!r-r'|  e-ifc2lr-r'| 

^i(r,r')=  .  h(ry)=  ,  (7) 

and  kl  =  w^/zoCo,  ^2  =  ^2  +  and  /Z2,  h  are  constants,  o'*  is  the  conductivity 

of  the  workpiece,  which  is  taken  to  be  homogeneous,  and  the  anomalous  conductivity  is  defined  to 
be  i7a(r)  =  <7(r)  —  a^.  Note  that  cr„  vanishes  off  the  flaw. 

System  (1)  and  (2)  is  a  modest  extension  of  the  system  that  appears  in  [4],  in  that  our  system 
contains  terms  corresponding  to  the  presence  of  the  incident  fields  in  region  2.  Poggio  and  Miller 
[5j  have  developed  a  system  for  and  J,s,  but  their  system  is  of  the  first  kind,  whereas  (1)  and 
(2)  are  of  the  second  kind.  Systems  of  the  second  kind  usually  result  in  better  conditioned  matrix 
equations,  which  provides  significant  advantages  in  numerical  computations. 

System  (4)  and  (5),  like  the  previous  one,  has  four  surface  current  components  as  unknowns. 

It  consists  of  a  first-kind  integral  equation,  (4),  and  a  second-kind  integral  equation,  (5).  The  first 
equation  is  the  electric- field  integral  equation  (EFIE),  and  the  second  the  magnetic-field  integral 
equation  (HFIE).  EFIE’s  and  HFIE’s,  as  individual  equations,  are  well  known  and  understood  in 
the  computational  electromagnetics  community  [5].  MacCamy  and  Stephan  [6],  and  Wang  and  Ida 
[7]  have  used  a  similar  system  to  solve  eddy-current  problems. 

In  both  of  these  systems,  Ja(r)  is  the  anomalous  current  density  that  is  associated  with  the 
flaw.  Equations  (3)  and  (6)  are  volume- integral  equations  for  the  anomalous  current.  These  same 
equations  (with  a  different  right-hand  side,  of  course)  are  already  solved  very  efficiently  in  VIC-3D. 

The  incident  fields  in  region  1  are  due  to  the  exciting  coil.  Hence,  we  have 


ES’'’(r)  =  /  er(r|r')-J.(r')<i*;'  (8) 

./coil 

Hf'(r)  =  f  erirW)J.(r')dv',  (9) 

i/coil 

whereas  those  in  region  2  are  given  by  integrals  over  the  region  of  the  flaw: 

E«(r)  =  /  er(r|r')-Ja(r')<io'  (10) 

./flaw 

H«(r)  =  [  5?'(r|r')-J„(rV.'.  (11) 


Q  is  a.  dyadic  Green’s  function  whose  subscript  defines  its  region,  and  whose  superscripts  denote 
its  type.  The  first  superscript  denotes  the  nature  of  the  field  at  r,  e  being  electric,  and  m  magnetic, 
whereas  the  second  superscript  denotes  the  nature  of  the  source  at  r'.  Oi  is  determined  from  0i, 
and  O2  from  (f>2. 

The  integral  equations  are  discretized  by  means  of  Galerkin’s  variant  of  the  method  of  moments. 
The  anomalous  current,  Ja(r),  is  expanded  in  three-dimensional  pulse  functions  that  are  defined 
on  a  regular  grid,  and  the  resulting  equation  is  then  tested  with  the  same  functions.  This  results 
in  a  matrix  equation  in  which  the  matrix  is  Toeplitz  in  all  three  dimensions: 

G{jlm,JLM)  =  -  L,m  -  M,j  -  J)  .  (12) 
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Figure  3:  Eight-node  serendipity  element. 

This  feature  gives  VIC-3D  its  ability  to  solve  very  large  problems  efficiently  on  personal  computers 
with  a  minimum  of  storage.  (We  have  solved  problems  with  50,000  unknowns  in  a  day,  using  less 
than  20MB  on  a  personal  computer.) 

DISCRETIZATION  OF  THE  BOUNDARY-INTEGRAL  OPERATORS 

The  discretization  of  the  boundary-integral  operators  also  uses  Galerkin’s  variant  of  the  method 
of  moments,  except  that  the  unknown  surface  currents  are  expanded  in  vector  edge-elements  that 
are  deiined  on  a  surface  mesh  [12],[13].  These  elements  preserve  continuity  of  normal  components 
of  the  currents  across  edges  of  the  mesh,  thereby  guaranteeing  that  the  unknown  currents  belong 
to  H{div),  the  space  of  square-integrable  functions,  whose  divergences  are  also  square-integrable. 

We  have  developed  a  theory  of  higher-order  edge-elements  (i.e.,  elements  that  interpolate  to 
higher-order  than  linear),  which  is  implemented  using  the  eight-node  serendipity  element  of  finite- 
element  theory  [8].  The  element  is  shown  in  Figure  3,  and  the  interpolating  polynomials  are  listed 
in  (13). 

Eight— node  serendipity  quadrilateral : 

>;)  =  1(1  -  f)(i  -  -  (1  -  e')(i  -  >7)  -  (1  -  >7")(i  -  01/4 

M(,  v)  =  1(1  +  0(1  -  -  (1  -  0)(i  -  -  (1  -  >)0(i  +  01/4 

WO  0  =  1(1  +  0(1  +  0  -  0  -  0)(i  +  0  -  (1  -  >;0(i  +  01/4 

W0>7)=  1(1 -0(1  + O-  (1-0X1  + >!)-(l  -770(1-01/4  (13) 

V>5(0>7)=(1-0)(1->7)/2 
woo  =  (1-770(1 +  0/2 
WO>7)  =  (1-0)(1  +  i7)/2 
W077)  =  (1-770(1  -0/2. 

One  can  expand  both  the  geometry  of  the  workpiece  and  the  surface  currents  using  (13)  for 
interpolation,  which  makes  this  an  isoparametric  element.  If,  however,  the  order  of  the  geometric 
shape  function  is  greater  than  the  order  of  approximation  of  the  fields,  then  the  geometric  elements 
are  called  superparametric;  conversely,  the  geometric  elements  are  called  subparametric  if  their 
order  is  less  than  that  of  the  field  shape  elements  [9].  There  are  advantages  to  using  different 
orders  of  approximation  of  the  geometry  and  fields.  Ingber  and  Ott  [9]  have  gotten  good  results 
when  approximating  the  geometry  to  second  order,  while  approximating  the  currents  with  linear 
shape  functions.  This  means  that  there  may  be  fewer  variables  required  to  represent  the  field  than 
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to  represent  the  geometry,  with  a  consequent  reduction  in  computer  resources  to  solve  for  the  fields. 
The  interpolating  vector-basis  functions  are  given  by 


fji  =  v) 
ii2  = 


(14) 


an  •  (a^  X  Sirj) 


Tr)  ■ 


The  vectors  and  a,  are  the  unit  tangent  vectors  to  the  surface  along  the  ^  and  i]  directions, 
while  a-n  is  the  unit  normal  to  the  surface.  The  vector,  r^,  is  normal  to  the  ^  =  constant  curve,  and 
points  in  the  direction  of  increasing  Thus,  is  normal  to  a„.  is  normal  to  the  t)  =  constant 
curve,  and  points  in  the  direction  of  increasing  17.  Thus,  r,,  is  normal  to  a^.  The  interpolating 
vector-basis  functions  satisfy  -f,!  =  1,  at  the  ith  node,  and  vanish  at  the  other  nodes;  •fi2  =  1 
at  the  z’th  node,  and  vanishes  at  the  other  nodes. 

We  have  developed  a  method  of  handling  the  improper  integrals  that  occur  in  computing  the 
matrix  elements  of  the  boundary-integral  operators.  This  algorithm,  which  is  based  on  an  idea 
of  Hayami  [10],  requires  one  to  locate  the  field  point  that  is  nearest  the  source  region,  and  then 
translating  the  origin  of  coordinates  to  this  point.  The  computation  is  then  completed  in  a  polar 
coordinate  system,  which  has  the  effect  of  removing  the  singularity. 

SOLUTION  STRATEGIES 


The  discretized  equations  that  are  obtained  by  applying  the  method  of  moments  to  integral 
equations  involve  dense  matrices,  with  no  particular  mathematical  structure.  Hence,  it  is  important 
to  develop  efficient  algorithms  for  solving  the  discretized  equations.  We  used  the  Linpack  [16] 
routine  CGECO  to  compute  the  LU  factorization  of  the  system  matrix,  and  CGESL  to  solve  the 
resulting  system  of  equations.  If  the  problem  is  too  large  to  accomodate  the  LU  factorization,  we 
employ  the  conjugate  gradient  algorithm  that  is  used  in  VIC-3D©. 

In  order  to  understand  the  conjugate  gradient  algorithm,  we  write  the  system  to  be  solved  as 
the  operator  equation 

r  =  AX,  (15) 


where  Y  stands  for  the  array  of  applied  electric  and  magnetic  fields,  and  X  stands  for  the  unknowns 
algorithm  also  uses  the  adjoint  operator.  A*,  which  corresponds  to  the  conjugate 
transpose  of  a  matrix. 

The  conjugate  gradient  algorithm  starts  with  an  initial  guess,  Xq,  from  which  we  compute 
Rq  =  Y  ~  AXq,  Pi  =  Qq  —  A* Rq.  Then  for  k  =  1,2,...,  we  update  Xk  by  the  following  steps: 


Sk 

ttk 

Xk 

Rk 

Qk 

bk 


APk 

\\Qk-^\? 

Xk-\  +  O-kPk 

Rk-\  -  akSk 

A^Rk 
WQkf 
IIQA.-1IP 
Qk  +  bkPk  . 


(16) 


When  ||.Rt[|/||y|l  is  sufficiently  small,  Xk  is  accepted  as  the  solution  of  (15). 

The  quality  of  the  solutions  obtained  from  the  matrix  factorization  and  conjugate  gradient  are 
comparable  and  the  errors  in  the  unknown  currenst  J  are  quite  small.  In  a  typical  problem,  solution 
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Figure  4:  A  printed- circuit  coil. 


Figure  5:  A  split-core  differential  probe. 


by  LU  decomposition  required  about  13  min.  on  the  Sun  workstation,  while  the  conjugate  gradient 
solution  took  about  90  min  to  converge.  While  this  initially  makes  the  conjugate  gradient  approach 
look  less  attractive,  there  is  a  potentially  bright  future. 

The  discretized  boundary-integral  equations  do  not  possess  the  special  Toeplitz  and  Hankel 
structure  of  the  volume-integral  equation  that  is  solved  in  VIC-3D®,  so  it  appears  that  we  must 
compute  the  matrix-vector  products  in  the  usual  manner  when  applying  the  conjugate  gradient 
algorithm.  Rokhlin,  [14]  and  [15],  has  recently  developed  an  algorithm,  called  the  “fast  multipole 
method  (FMM),”  to  obviate  this  problem.  The  algorithm  is  based  on  the  expansion  of  the  full- 
space  Greens’s  function,  fr,  in  terms  of  multipoles,  i.e.,  products  of  spherical  Bessel  functions, 
spherical  Hankel  functions,  and  Legendre  polynomials.  In  its  simple  single-stage  form,  the  algorithm 
reduces  the  computational  complexity  of  a  matrix-vector  multiplication  from  O(iV^)  to 
where  N  is  the  number  of  unknowns.  Coifman,  et  al  [15]  indicate  how  the  algorithm  may  be  applied 
to  vector  problems  in  electromagnetics,  when  the  appropriate  integral  equation  is  the  electric  field 
integral  equation,  but  not  the  magnetic  field  integral  equation.  During  our  present  research  we  will 
apply  the  algorithm  to  our  system,  which  consists  of  both  equations. 

ANALYSIS  OF  PRINTED-CIRCUIT  AND  SPLIT-CORE  DIFFERENTIAL  PROBES 

The  model  will  include  the  incident  fields  of  (8)  and  (9)  that  are  produced  by  the  printed-circuit 
coil  of  Figure  4  and  the  split-core  differential  probe  of  Figure  5. 

Arrays  of  printed-circuit  coils  are  becoming  prominent  in  precision  measurements  of  complex 
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shapes,  such  as  dovetail  slots  in  aircraft  turbine  disks  [11].  A  significant  advantage  of  such  arrays 
is  that  they  are  fixed  in  space  on  the  part,  and  no  mechanical  scanning  is  required. 

The  split-core  differential  probe  is  also  used  in  studying  complex  shapes.  It  consists  of  two 
identical  ferrite  cores,  around  each  of  which  is  a  coil.  The  arrangement  produces  a  differential  signal, 
which  is  the  difference  of  the  signals  seen  by  each  of  the  two  parts  when  the  probe  is  scanned  over  a 
flaw.  The  differential  probe  is  sensitive  to  edges,  and  not  to  slowly  varying  background  conditions. 
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1  Introduction 

The  near-fields  of  microstrip  antennas  printed  on  a  single  layered  substrate  and  radiating  into 
free-space,  have  recently  been  computed  and  measured  by  Mosig  et  al  [1].  Their  ideas  have  been 
extended  [2]  to  deal  with  multi-layered  dielectrics  and  the  resulting  formulation  was  used  to  develop 
a  triangular  subdomain  moment  method  code  which  is  based  on  the  theory  of  Rao,  Wilton  and 
Glisson  [3]  but  incorporating  Mosig’s  [4]  multi-layered  Green’s  function.  The  code  is  efficient  and 
suitable  for  use  on  present  generation  personal  computers;  by  contrast  we  note  that  Mosig  et  al.  [1] 
used  a  4  processor  CRAY-2  computer.  The  new  code  was  tested  by  comparing  numerical  predictions 
with  measurements  made  for  a  planar  two-arm  Archimedes  spiral  antenna  [5,  6,  7,  8,  9],  embedded 
in  a  multi-layered  medium  and  fed  by  a  Marchand  balun  [10,  11].  The  antenna  was  designed  for 
operation  between  2  to  6  GHz.  The  self- complement  ary  spiral  pattern  was  etched  onto  a  thin  copper 
clad  dielectric  substrate  and  a  multilayered  microwave  absorber  was  placed  between  the  spiral  and 
the  conducting  ground  plane  to  suppress  interference  due  to  reflections.  To  stringently  test  the  code 
the  antenna  was  also  covered  with  a  multilayered  superstrate.  Unlike  Archimedes  spirztl  antennas 
for  avionic  applications,  which  are  usually  backed  by  a  closed  cavity  with  conducting  side  walls, 
the  test  antenna  was  not  cavity  backed.  Table  1  lists  some  of  the  antenna  parameters  and  the 
conductor  pattern  is  shown  in  Fig.  1. 


2  Computation  of  the  Radiated  Fields 

Using  the  notation  of  Rao,  Wilton,  and  Glisson  [3],  the  electric  and  magnetic  fields  are  computed 
at  an  observation  point  {4>,p,z)  due  to  a  source  triangle  supporting  a  certain  known  current 
distribution  as  shown  in  Fig.  2.  The  coordinates  x,y,z  are  global  coordinates,  and  X,Y,Z  are 
local  coordinates,  with  X  the  unit  vector  of  p+,  and  Z  parallel  to  z.  The  horizontal  electric  dipole 
(HED)  is  X  directed.  In  Section  2.9  [2,  pp.  11-14]  the  Green’s  functions  for  the  fields  (electric  and 
magnetic)  of  a  HED  buried  in  a  multi-layered  medium  were  computed  in  terms  of 

1  P  1,2,-),  I  P  1, 2,)  and  |  P  |,  2,) 

where  P  =  p  —  r  ,  a  =  Zp+,  (3  =  IF  —  a  and  r  =  ^ri  +  +  (1  —  17  —  Here  ri,  r2  and 

rs  are  the  vectors  to  the  vertexes  of  triangle  T+  in  the  global  coordinate  system. 
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Transform  these  Green’s  functions  from  the  local  to  the  global  coordinates  to  obtain 

=  cosaG^^-sinaG^^, 

G^-^  =  sinoG^^  +  cosaG^^, 

Q^X  ^  gZX 

Now  substitute  the  expressions  of  and  G^^  obtained  in  Section  2.9,  Eq.  2.46  [2,  pp. 

13]  to  get 

Glf^  =  cos  a  [cos^  ySEi  +  sin^  ^£2]  -  sin  q  [sin  13  cos  (3{Ei  —  E2)] 

—  cos,5cos(/?  +  q;)Ei  +  siny8sin(/5  +  a)E2.  (2) 

Similarly,  for  the  other  components 

Gf  =  cos/3sin(/3  +  a)Ei  -  sin/3cos(/l  +  a:)E2) 

G^  =  cos^/3, 

G^^  =  sin/3cos(/3  +  a)ifi  -  cos/?sin(/3  +  Q)fl'2, 

G^jf  =  sin  (3  sin(^  +  (y.)Hi  +  cos  (3  cos((3  +  a)H2, 

Gjf  =  sin/l/io-  (3) 

Here  Ei,  £2^  Eu  H2,  ho  are  Sommerfeld  integrals  [2,  Section  2.9]  which  must  be  evaluated  nu¬ 
merically  [2,  Chapter  3]. 


This  gives  the  total  Green’s  functions  for  an  HED.  The  total  fields  due  to  the  current  distribution 
are  found  by  integration  over  the  source  triangle,  thus 


(4) 


Now  substitute  the  current  in  terms  of  the  basis  function  given  by 


J  i!4  I  P+  I  p+ 


to  obtain 


{  H  }  ~  -Ptlptld^  dr] 

~  Jo  lo  4"  zG^;/|  I  p^  I  dr).  (5) 

In  the  last  step,  the  Green’s  functions  of  Eqs.  2  to  3  were  used,  since  X  =  p^.  Eq.  5  gives  the  E 
and  H  fields  radiated  by  triangle  T^.  Add  the  contribution  of  triangle  T~,  which  is  also  spanned 
by  the  nth  basis  function 


I  +  jyG-p;;lp-\d(d,,^,  (6) 

where  p~  is  directed  towards  the  free  vertex  of  triangle  T~ . 

The  total  fields  are  obtained  by  adding  the  contributions  of  all  the  basis  functions 

{ H } = s  ii'  r  ^  I  ^  II  r  ® 

where  N  is  the  total  number  of  basis  functions. 
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3  Spiral  Buried  in  Layered  Media 


Consider  Fig.  3  where  the  Archimedes  spiral  antenna  is  located  inside  a  multi-layered  medium 
consisting  of  free-space,  polystyrene  foam  spacers  (cr  =  1.1— jO. 00011)  and  a  three  layered  absorber 
(EC COSORB  AN  75,  GRACE  Electronic  Materials).  The  AN  75  absorber  consists  of  three  layers 
A,  B  and  C;  the  thickness  of  each  layer  is  given  in  Fig.  3.  The  dielectric  properties  of  the  layers 
were  measured  in  S-band  waveguide,  and  Table  2  gives  the  resulting  values  of  =  €^(1  —  j  tan 6). 
Note  that  the  upper  layers  A  and  B  in  Fig.  3  are  thinner  than  their  counterparts  at  the  bottom. 
The  reason  is  that  the  upper  AN  75  was  split  into  its  three  component  layers  A,  B  and  C,  when 
the  permittivity  was  measured,  and  the  machining  reduced  the  thickness  of  layers  A  and  B  in  the 
upper  AN  75. 

Fig.  4  gives  the  magnitude  of  Ey  at  4.03  GHz,  measured  and  computed  in  the  radiating  near-field 
along  the  y-axis,  at  2  =  50  mm  from  the  conductor  surface,  with  and  without  the  polystyrene  and 
AN  75  superstrates.  The  measurement  and  computational  grids  were  both  300  mm  x  300  mm  in 
steps  of  5  mm.  The  dimensions  of  the  polystyrene  and  AN  75  slabs  were  also  300  mm  x  300  mm. 

The  measured  and  computed  magnitude  patterns  compare  well.  The  narrowing  of  the  beam  due 
to  the  superstrate  is  a  notable  feature  which  is  of  practical  interest.  The  measured  and  computed 
attenuation,  when  the  polystyrene  and  AN  75  are  inserted  between  the  spiral  and  the  observation 
point,  are  -13.9  dB  and  -15.1  dB  respectively  which  is  considered  to  be  good  agreement. 

Fig.  5  gives  the  measured  and  computed  phase  difference  in  Ey  at  4.03  GHz,  with  and  without  the 
polystyrene  and  AN  75  slab  in  front  of  the  spiral.  Agreement  is  very  good  except  near  the  edges 
of  the  polystyrene  and  AN  75  slabs  where  it  is  thought  that  diffraction  may  become  significant,  as 
suggested  by  the  bottom  graph  of  Fig.  5. 


4  Execution  Speed 

It  took  3  min  17  sec  (on  a  6  Megaflops/sec  66  MHz  Pentium  computer)  to  compute  the  current 
distribution  and  the  far-field  parameters  of  the  spiral.  When  the  fields  were  computed  on  a  grid 
of  51  X  51  points,  it  required  an  additional  13  min.  The  spiral  was  discretized  into  524  triangles, 
resulting  in  523  basis  functions.  The  code  was  written  in  Fortran,  FTN77/486  version  2.73,  created 
at  Salford  Software  Ltd.  on  27  October  1993.  The  subroutines  used  to  calculate  the  Bessel  functions 
with  complex  arguments,  were  written  by  Donald  E.  Amos,  at  Sandia  National  Laboratories.  In 
the  moment  method  code,  most  of  the  memory  is  used  by  the  Z  matrix,  for  which  only  the  upper 
triangle  was  stored.  The  set  of  linear  equations  was  solved  using  two  of  LINPACK’s  algorithms; 
zspfa.f  and  zspl.f.  The  amount  of  memory  needed  to  store  the  complex  elements  of  the  Z  matrix, 

Ar2  _  /V  iV^ 

16  X  -f  A)  «  16  X  (-y)  Bytes  , 

where  N  is  the  total  number  of  basis  functions.  With  N  =  1414,  the  size  of  the  2  matrix  is 
16  Megabytes.  The  program  requires  a  small  amount  of  overhead  memory,  therefore  (on  a  16 
Megabytes  machine)  N  must  be  a  little  smaller  than  1414.  To  confirm  this,  the  Archimedes  spiral 
antenna  was  analyzed  by  increasing  the  number  of  triangles  to  Tn  =  1254,  resulting  in  A  =  1322 
basis  functions.  The  execution  time  (6  Megafiops/sec  66  MHz  Pentium  computer)  to  compute  the 
current  distribution  and  the  far-field  parameters  for  a  single  frequency  point,  was  15  min.  18  sec. 
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5  Conclusions 


An  efficient  method  of  moments  (MoM)  technique  has  been  developed  to  analyze  arbitrarily  shaped 
planar  antennas  which  are  buried  in  multi-layered  media.  The  total  fields  are  computed  with  the 
observation  point  located  inside  any  layer,  and  at  any  arbitrary  distance  within  that  layer.  The 
numerical  predictions  were  verified  by  measuring  the  amplitude  and  the  phase  of  the  near-fields  of 
an  Archimedes  spiral  antenna  and  some  of  those  results  have  been  presented  here.  Conventional 
far-field  pattern  measurements  also  show  good  agreement  with  predictions  [2].  Results  of  the  code 
were  also  compared,  with  good  agreement,  to  other  (albeit  less  powerful)  codes  [2]. 

Since  the  new  code  computes  the  total  fields,  including  the  near-fields,  of  planar  antennas  embedded 
in  multi-layered  media  it  may  be  used  to  study  the  properties  of  antennas  for  ground  penetrating 
radars,  low  profile  communication  systems,  and  microwave  hyperthermia. 
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Table  1:  Parameters  of  Archimedes  Spiral  Antenna 


Parameter 

Value 

Spiral  constant,  a 

Strip  width,  w 

Feed  wire  length,  Cgt 
Spiral  arm  length,  L 
Range  of  winding  angle,  (l>  ' 

1.1459  mm 

1.8  mm 

3  mm 

519.3  mm 

2.618  rad  <  .^  <  30.137  rad 

Figure  1;  Archimedes  spiral  conductor  pattern  showing  the  segmentation  used  for  the  trian¬ 
gular  moment  method  model. 


Figure  2:  Coordinate  system  with  source  triangle  and  observation  point  at  p,  z). 
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Table  2:  Measured  and  tan^  of  the  three  layers  forming  AN  75 


frequency  [GHz] 

1  A  1 

1  B 

c 

- 7— 

4 

4 

2.5000 

1.5977 

0.3957 

1.5783 

0.5318 

5.4342 

2.5347 

2.6700 

1.6333 

0.4203 

1.6208 

0.5476 

5.2044 

2.5811 

2.8400 

1.6091 

0.4391 

1.6003 

0.5612 

5.0594 

2.5515 

3.0100 

1.5908 

0.4292 

1.5870 

0.5443 

4.8974 

2.5247 

3.1800 

1.5859 

0.4182 

1.5860 

4.6838 

2.5241 
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Figure  3:  Archimedes  spiral  backed  by  AN  75  and  radiating  through  AN  75. 
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Introduction 

When  a  radar  illuminates  an  object,  the  complex  back  scattered  field  will  vary  rapidly  in  both 
frequency  and  orientation  angle.  Consequently,  high  sampling  rates  in  both  frequency  and  angle  are 
required.  The  product  of  this  dense  sampling  is  a  large  amount  of  data  that  must  be  stored  and 
processed. 

It  is  clearly  of  interest  to  reduce  the  vast  amount  of  data  required  while  still  preserving 
adequate  accuracy  of  images  that  can  be  produced.  A  reduced  data  set  allows  for  compactibility  and 
quick  reconstruction,  thus  enabling  real-time  processing.  A  compression  technique  has  been 
developed  [1,  2]  which  analyzes  the  scattering  centers  of  a  return  on  an  individual  basis  in  the 
frequency  domain  where  the  return  has  a  simple  and  smooth  shape.  Therefore,  if  these  scattering 
centers  are  extracted  and  their  individual  responses  found,  each  scattering  center  can  be  sampled  fewer 
times  due  to  its  smooth  shape  and  still  be  reconstructed  accurately.  Obviously,  with  fewer  samples 
required,  the  data  has  been  effectively  compressed.  In  fact,  the  technique  has  compressed  image  data 
by  a  factor  as  large  as  several  hundred,  depending  upon  the  specific  object. 

This  study  examines  the  many  tradeoffs  and  assumptions  of  the  compression  and 
reconstruction  algorithm  through  the  application  of  Response  Surface  Methodology  (RSM).  The  first 
goal  of  this  work  was  to  identify  which  variables,  parameters,  and  assumptions  most  affect 
reconstruction  accuracy.  The  second  goal  was  to  characterize  how  the  input  variables  relate  to 
reconstruction  accuracy.  RSM  also  affords  the  ability  to  predict  optimum  settings  for  the  variables 
without  the  need  for  years  of  experience.  Thus,  RSM  would  provide  less  experienced  operators  with  a 
systematic  method  of  choosing  variables  and  settings  to  attain  optimal  results. 

Background 

In  a  broad  sense,  the  compression  technique  identifies  scattering  centers,  uses  Fourier  analysis, 
and  is  iterative  in  nature.  The  first  method,  explored  by  Tseng  [1],  was  based  on  the  assumption  that  a 
object’s  scattered  field  can  be  viewed  as  the  superposition  of  various  scattering  centers’  responses. 
Tseng  could  then  transform  the  data  to  construct  an  image  of  the  object  and  show  spatial  locations  of 
the  scattering  centers.  These  scattering  centers  were  then  extracted  from  the  data  set  through  image 
gating  or  spectral  smoothing.  Tseng  showed  that  the  spectral  response  of  these  scattering  centers  was 
smooth  and  simple  with  respect  to  scanning  frequency  and  angle.  Thus,  sampling  this  spectrum  data 
could  greatly  reduce  the  required  number  of  samples,  and  hence,  reduce  data  storage. 

Further  work  by  Burnside  and  Chang  [2]  showed  the  assumption  that  a  complicated  object 
could  be  viewed  as  a  superposition  of  isolated  scattering  centers  was  insufficient  due  to  the  fact  that 
such  objects  exhibit  anisotropic  frequency  dependent  characteristics.  These  mechanisms  may  have 
very  sophisticated  phase  histories  due  to  the  object’s  complicated  structure.  Consequently,  the  image 
that  is  created  may  not  exhibit  isolated  peaks  spatially,  but  instead  may  show  clusters  of  peaks  that 
may  not  be  individually  extractable.  Therefore,  they  decided  to  treat  these  scattering  centers  as  either 
a  complex  point  scattering  mechanism  or  a  line  segment  scattering  mechanism.  With  these  definitions 
and  the  appropriate  algorithms  for  each,  one  can  follow  the  same  extraction  process  as  presented  by 
Tseng. 

The  focus  of  this  research  was  the  application  of  RSM  to  these  data  compression  and 
reconstruction  algorithms.  RSM  is  a  collection  of  statistical  tools  by  which  an  organized  series  of 


^This  work  was  supported  by  the  U.  S.  Air  Force  Wright  Laboratories. 
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experiments  can  be  executed  with  prescribed  procedures  for  response  analysis.  It  seeks  to 
mathematically  represent  the  connection  between  a  series  of- inputs  and  outputs.  Box  and  Draper  [3] 
more  formally  define  RSM  as  a  group  of  statistical  techniques  for  empirical  model  building  and 
exploitation.  By  careful  design  and  analysis  of  experiments,  it  seeks  to  relate  a  response,  or  output 
variable  to  the  levels  of  a  number  of  predictors,  or  input  variables.  According  to  Khuri  and  Cornell 
[4],  RSM  encompasses:  (1)  Designing  a  series  of  experiments  that  will  yield  adequate  and  reliable 
measurements  of  the  response(s)  of  interest  in  a  region  of  interest;  (2)  Analyzing  the  results  of  those 
experiments  to  determine  a  mathematical  model  that  best  fits  the  data  collected;  (3)  Searching  for  the 
optimal  settings  of  the  input  variables  that  produce  a  desired  (often  the  maximum)  response. 

RESULTS 

A  simplified  object  data  set  (line  scatterer)  was  selected  to  verify  the  technique  before 
considering  more  complicated  data  sets.  The  simulated  scattering  fields  of  the  line  scatterer  are  based 
upon  a  physical  optics  (PO)  approximation  and  simulate  the  co-  and  cross-  polarized  measured  fields 
as  if  measured  in  a  compact  range.  Figure  (1)  shows  two  images,  one  from  the  original  data  set  and 
one  generated  from  the  compressed  data  set. 

Based  on  Design  of  Experiments  criteria,  a  fractional  factorial  design  was  selected  to 
accomplish  the  data  runs  with  the  parameter  settings  shown  in  Table  (1)  with  the  resulting  output 
statistics  shown  in  Table  (2).  The  parameters  used  in  the  tables  are  explained  below.  All  of  the  output 
statistics  were  based  upon  a  normalized  multi-angle  correlation  metric  given  by 


Correlation  _  Metric  =  p 


1=1 


(1) 


that  was  calculated  on  a  point  by  point  basis.  The  original  (x)  and  reconstructed  (y)  values  are  the 
complex  electromagnetic  fields  at  a  given  angle  and  frequency.  This  metric  is  thus  a  measure  of  the 
error  between  the  fields  computed  with  the  original  data  and  those  computed  with  the  compressed 
data.  The  sums  are  performed  over  N  angles  at  a  single  frequency.  This  metric  was  chosen  due  to  the 
wide  use  of  correlation  in  other  research.  Due  to  non-normalization  with  respect  to  reconstruction 
energy,  values  of  correlation  may  be  above  and  below  one,  with  one  being  the  ideal. 

The  following  equations  were  used  in  calculating  the  correlation  error  metrics  as  shown  in 
Table  (2). 


Average  Correlation  = 


P  = 


n 

Mean  Square  Error  =  MSE  =  ^ 

f=i 


(p.-ir 


(2) 

(3) 


Root  Mean  Square  Error  =  Root  _  MSE  = 


Average  Absolute  Difference  =  Avg  _  Abs  _  Diff  = 


t 


(4) 

(5) 
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(6) 


These  metrics  seek  to  characterize  the  correlation  shown  in  Figure  (2)  and  ideally  drive  the  correlation 
plot  to  LOO  across  the  entire  angle  span.  Equations  (2),  (3),  (4),  and  (6)  were  examined  for  empirical 
model  building  appropriateness. 

Looking  at  the  Root  Mean  Square  Error  (Root  MSE)  as  the  primary  response  variable,  a  coded 
linear  equation  was  produced  via  a  least  square  analysis  given  by 


Y{x)  =  0.2291  +  0.0109xj  +  0.0490^^  +  0.0442x3  +  0.0288x,  -  0.031  Ixj  (8). 


The  analysis  of  variance  (ANOVA)  table  shown  in  Table  (3)  shows  a  large  value  of  0.8880  for  R^ 
signaling  a  high  accounting  of  the  model  variability  and  a  small  MSE  value  of  0.01.  Additionally,  F  = 
9.5156  indicates  a  good  fitting  model  even  though  it  is  acting  more  like  a  signal-to-noise  ratio  than  a 
true  F  statistic  (the  higher  the  better).  Gradient  points  were  generated  and  are  shown  in  Table  (4). 
Note  that  the  data  compression  and  reconstruction  process  is  deterministic,  i.e.  the  same  results  are 
obtained  with  every  repeated  run.  Additionally,  all  of  the  model  error  is  attributed  to  lack  of  fit  since 
the  compression  and  reconstruction  process  is  deterministic.  A  gradient  search  technique  was 
employed  to  reduce  the  Root  MSE  values.  Integer  multiples  were  calculated  and  new  runs  executed 
corresponding  to  Runs  13  to  17  in  Table  (2).  Scanning  down  the  associated  Root  MSE  column,  the 
numbers  appear  to  be  declining  with  each  run  as  would  be  expected.  Note,  however,  Run  16’s  value 
for  Root  MSE  is  greater  than  the  two  previous.  This  increase  in  response  indicates  curvature  in  the 
surface  and  is  the  beginning  of  “hill”  on  the  response  surface.  Run  17,  however,  shows  a  smaller  value 
for  Root  MSE,  and  in  fact  the  smallest  of  any  run  made  so  far.  Achieving  this  value  indicates  the  hill 
that  was  previously  encountered  is  actually  just  a  small  bump  on  the  surface.  The  gradient  technique 
as  performed  in  effect  passed  through  the  bump  and  continued  on  to  the  next  point.  Considering  all 
the  gradient  points,  the  technique  appears  to  be  tracking  the  surface  and  not  being  overly  influenced  by 
its  bumpy  surface.  If  that  is  indeed  the  case,  then  Root  MSE  does  pose  as  a  viable  response  predictor 
variable. 

Average  Correlation  was  examined  next;  a  coded  linear  equation  was  produced  via  a  least 
square  analysis  and  given  by 


Y{x)  =  0.9017  -  0.0019xj  -0.0402x3  -  0.0433x3  “  0.0194x4  -0.0203x5  (9). 


2 

ANOVA  values  are  shown  in  Table  (5).  From  the  ANOVA  table,  it  is  seen  that  the  R  value  has 
increased  from  the  0.8880  value  for  the  Root  MSE  measure  to  a  value  of  0.9499.  It  is  usually 
meaningless  to  compare  the  MSE  between  data  sets  because  MSE  only  has  meaning  in  the  context  of 
the  total  sum  of  squares  and  degrees  of  freedom  (accounted  for  in  the  F  ratio).  The  F  statistic  is  F  = 
22.7376  and  is  much  larger  than  Root  MSE.  Both  of  these  improvements  signal  an  even  better  fitting 
model.  Gradient  points  were  generated  and  are  shown  in  Table  (6).  These  points  correspond  to  Runs 
18  to  22  of  Table  (2).  Again,  by  examining  the  associated  values  for  Average  Correlation  for  these 
runs,  the  numbers  decrease  from  the  start  and  then  increase  to  a  fairly  high  value  of  average 
correlation.  The  same  phenomena  can  again  describe  the  response  for  Average  Correlation  where  the 
gradient  is  encountering  bumps  along  the  surface  but  is  continuing  to  show  improved  responses. 


Conclusion 

This  research  effort  used  data  compression  and  reconstruction  computer  codes  comprising  a 
compression  technique  in  which  individual  scattering  centers  are  analyzed  in  the  frequency  domain 
through  the  use  of  Fourier  techniques.  In  this  domain,  the  scattering  centers  possess  simple  and 
smooth  shapes.  TTieir  shapes  allow  for  broader  sampling  in  the  frequency  domain,  and  reduce  the 
storage  requirements.  In  the  reconstruction  process,  the  relatively  smooth  shapes  allow  for  simple 
interpolation  schemes  to  be  employed  to  recreate  the  frequency  response.  Examples  of  reconstructed 
data  for  a  simple  object  were  shown  and  compared  to  the  original  data.  Actual  examples  appear  in  [1] 
and  [2].  All  of  the  examples  show  a  favorable  comparison  between  the  original  and  reconstructed 
data. 

A  set  of  empirical  model  building  and  exploitation  techniques,  known  as  the  Response  Surface 
Methodology  (RSM),  was  applied  to  the  data  compression  and  reconstruction  algorithms.  By  careful 
design  and  analysis  of  experiments,  RSM  seeks  to  relate  a  response  (or  output  variable)  to  the  levels  of 
a  number  of  predictors  (or  input  variables)  that  affect  it.  A  set  of  five  variables  was  selected  as  the  key 
input  variables,  and  four  response  variables  were  considered  as  a  possible  means  of  measuring  the 
reconstruction  accuracy.  First-order  linear  empirical  models  were  developed  for  the  four  responses.  A 
gradient  search  was  conducted  to  identify  an  improved  set  of  inputs  for  what  was  deemed  the  top  two 
responses  which  were  Average  Correlation  and  Root  MSE.  Based  on  the  gradient  points,  the  fitted 
response  surface  appeared  to  exhibit  a  slight  irregularity  (bumpiness)  for  the  responses  considered. 

The  focus  of  this  effort  was  to  empirically  model  reconstruction  correlation,  in  a  fashion  that 
allows  for  input  and  output  variable  characterization.  One  would  like  to  be  able  to  drive  the  response 
(reconstruction  accuracy)  up  or  down  in  the  most  efficient  manner  given  a  particular  set  of  input 
variables.  Four  models  were  developed  that  showed,  via  ANOVA  statistics,  a  good  fitting  empirical 
model.  Two  of  the  four  (Average  Corr  and  Root  MSE),  were  exploited  via  a  gradient  search.  These 
results  showed  that  the  models  could  indeed  identify  the  trend  of  the  response  surface.  Slight 
irregularities  were  present,  however,  the  gradient  points  continued  to  improve  the  overall  response. 
Therefore,  based  on  the  results  attained  thus  far,  RSM  seems  to  provide  a  means  of  choosing  the  best 
input  variables  for  the  data  compression  and  reconstruction  algorithms  examined.  This  technique 
appears  to  provide  a  systematic  input  variable  selection  procedure  for  both  experts  and  non-experts  to 
effectively  utilize  when  compressing  RCS  data.  Hopefully,  continuing  efforts  with  this  topic  will 
indeed  produce  further  conclusive  results. 
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Table  (1 ):  Resolution  III  experimental  design  set-up:  2]f,~  Fractional  Factorial  Design 


RUN#  I 

1  AVGCORR 

TIME  DOMAIN  DATA 

1  MSE  !  RTMSE  |  AVGABSDIFF 

1  VARIANCE  1 

1  STND  DEV 

18 

0.9633 

0.0337 

0.1836 

19 

0.938 

0.0371 

0.1927 

20 

0.9266 

0.0303 

0.1739 

21 

0.9765 

0.0203 

0.1425 

22 

0.9829 

0.0092 

0.0962 

Table  (2):  Data  runs  for  experimental  design  set-up 
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Source 

Degrees  of  Freedom 

Sum  of  Squares 

Regression 

5 

0.0501 

0.0100 

Error 

6 

0.0063 

0.0011 

Total 

11 

0.0564 

R" 

F  Test  Statistic 

F  -Lack  of  Fit 

0.8880 

9.5156 

Inf 

1  SSPE 

SSLF 

SSPO 

MSPE  1 

0.0017 

Q  _ 1 

Table  (3):  ANOVA  table  for  Root  MSE 


1  Coded  Variable  Form  I 

X, 

Xa 

X3 

X4 

X5 

Center  Point 

wmam 

0 

0 

0 

0 

-0.619 

-0.5581 

-0.3635 

0.3929 

Point  1 

-0.1371 

-0,619 

-0,3635 

0.3929 

Point  2 

-0.2741 

-1.2381 

-0.7271 

0.7858 

Point  3 

-0.4112 

-1.8571 

-1.0906 

1.1787 

Point  4 

-0.5483 

-2.4762 

-2.2324 

-1.4541 

1.5716 

Point  5 

-0.6855 

-3.095 

-2.7905 

-1.8175 

1.9645 

Point  6 

-0.8226 

-3.714 

-3.3486 

-2.181 

2.3574 

Point? 

-0.9597 

-4.333 

-3.9067 

-2.5445 

■EESEB 

I  _ 

X1 

Xg 

X3 

X4 

Xs 

Center  Point 

3 

22 

-22 

Stec  Size 

-0.619 

-1.1162 

-0.7271 

Point  1 

0.2931 

2 

20.8838 

-21.2142 

Point  2 

0.2862 

2 

19.7676 

-20.4284 

Point  3 

0.2793 

2 

18.6514 

-24.1813 

-19.6426 

Point  4 

0.2724 

2 

17.5352 

-24.9084 

-18.8568 

Point  5 

0.2655 

2 

16.419 

-25.6355 

-18.071 

Point  6 

0.2586 

2 

■■eEuSI 

-17.2852 

Point  7 

0.2517 

2 

14.1866 

-27.0897 

-16.4994 

I  :  ^ 

Table  (4):  Gradient  point  runs  for  Root  MSE 


SPE  SSLF  "  SSPQ  I 

0  ""  "*  0.0018  0-0005  I  0 


Table  (5):  ANOVA  table  for  Average  Correlation 


Table  (6):  Gradient  point  runs  for  Avg  Corr 
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S  P  Walker,  M  J  Bluck,  M  D  Pocock,  C  Y  Leung,  S  J  Dodson 
Imperial  College,  London 


1.  Introduction 

The  integral  equation  approach  has  well  known  advantages,  including  the  need  for  discretisation  of  only  the 
surface,  and  the  satisfaction  of  boundary  conditions  at  infinity.  In  flie  time  domain,  it  has  the  additional 
advantage  of  not  requiring  the  solution  of  large  dense  matrices,  and  of  being  able  to  provide  results  at  a  many 
frequencies  from  a  single  computation^. 

Here  we  describe  the  implementation  of  a  time  domain  integral  equation  treatment,  employing  curvilinear, 
isoparametric  elements,  with  implicit  quadratic  modelling  of  the  temporal  variation.  This  has  the 
advantage  of  being  able  to  represent  arbitrarily  curved  surfaces  accurately,  avoiding  facetting,  and  to 
provide  a  similarly  efficient  representation  of  the  spatial  variation  of  the  fields.  In  sections  2  and  3  the  main 
features  of  the  approach  are  outlined,  with  a  discussion  of  its  successful  implementation  on  a  256  processor 
Cray  T3D  presented  in  section  4.  The  direct  coupling  to  industry  standard  CAD  software,  for  solid  modelling, 
meshing  and  post  processing  and  display  is  described  in  section  5.  Example  res  results,  including  the  8.4 
wavelength  NASA  almond^,  and  the  18  wavelength  NASA  cone-sphere  with  gap,  are  presented  in  section  6. 
Current  developments  are  noted  in  section  7. 

2Jjumerical  Treatment 

The  MFIE  for  scattering  from  a  perfect  conductor  can  be  written 

2;rH(r,f)  =  4;r  +  J(n'  x  H(r',i*))  x^-t-  |^n'  x  jx  (1) 

Here  is  the  surface  of  the  body,  r  is  the  point  on  the  surface  at  which  the  field  is  sought,  and  primes  denote 
other  points  on  the  surface,  a  distance  R  away,  where  the  unit  normal  is  n'.  The  time  is  t,  with  an  asterisk 
denotes  retarded  time. 

Geometry  representation 

To  evaluate  the  integral  we  peutition  the  surface  into  M  elements  m,  over  each  of  which  the  geometry  is 
represented  in  terms  of  intrinsic  co-ordinates  (see  figure  1); 

x‘  =  x'{^,Tl)  (2) 

The  actual  form  of  the  geometric  transformation  is 

=  a  =  l,..,a^  (3) 

where  the  summation  is  over  the  CX^  nodes  associated  with  a  particular  element,  the  x‘^  are  the  nodal 

locations,  and  the  are  shape  functions  associated  with  each  node.  A  Jacobian,  provides  the 

necessary  (position  dependant)  scaling  factor  between  areas  ds  in  real  space  and  areas  d^dri  in  intrinsic 
space. 
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The  full  ranee  of  element  types  available  in  (two  dimensional)  finite  elements  can  be  used.  We  have  thus  far 
taplSIJTed  slrirn^dXaMaterals  altd  6  noded  triangles.  In  each  of  these  ca«s  the  geometry  'stos 
allowed  to  vary  quadratically  between  nodal  locations.  -Ihe  particular  forms  of  the  shape  funchons  for  the« 
various  element  are  given  in  any  good  finite  element  lext3  U>cA  meA  refinement  to  model  parhcular 
features  of  the  geometry,  or  local  rapid  field  variations,  is  readily  performed. 

Field  representation 

Exactly  the  same  representation  is  adopted  for  the  spatial  variation  of  field  quantides;  the  time  dependant 
H  field  at  some  intrinsic  location  is  approximated  in  terms  of  the  nodal  field  component  values. 


(4) 


The  time  dependence  itself  is  modeUed  in  a  similar  fashion,  with  3  noded  quadratic  elements,  of  length  2  , 

with  associated  shape  functions  r^(T),  with  r  the  intrinsic  time.  Once  we  have  found  the  timesteps 
corresponding  to  the  temporal  nodes  p  of  the  time  element  in  which  the  retarded  bme  falls,  we  have 


^=1 


(5) 


Assembly  of  discretiscd  equations 

Inserling  all  the  above  into  (1),  we  can  write  the  integration  for  the  field  at  a  parHcular  timestep  for  each  of 
the  N  nodes  k  as  the  sum  of  integrations  over  M  elements: 


^  V  ^  J 

where  the  dot  denotes  the  time  derivative.  This  in  turn  can  be  written  as  a  matrix  equation 

2sH*  =  4irHL.,  +  XES[K]Hf 


m=!  a 


where 


[K]=  \ 


(6) 


(7) 


(8) 


in  which  the  matrix  A’  is  defined 

[Al  = 


-n^Ri  +  ”3^3  “^^3 

-n^Rt  + 

The  integrations  in  (8)  are  performed  using  Gaussian  quadrature;  typically  between  2x2  and  6x6  pomt 
Singular  and  Hypersingular  integrands 

The  integrand  in  (7)  is  seen  to  contain  both  l/R  and  1/R2  terms.  The  need  to  treat  these  d^s  not  arise  in  me 
more  usLl  flat-facetted  treatments;  the  n  x  H  term  is  then  necessarily  zero.  Proper  treatment  is  one  price 
which  must  be  paid  for  the  other  benefits  of  a  curvilinear  geometric  model. 
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The  1/R  term  is  of  the  kind  which  arises  also  in  elastostatics,  and  we  have  applied  essentially  the 
treatments  developed  there  Each  element  is  repartitioned  into  triangles,  with  the  location  of  the 
singularity  at  a  vertex.  Re-expression  in  polar  co-ordinates  removes  one  power  of  R„  rendering  the  integrand 
well  behaved. 

The  1/R  2  is  more  difficult.  It  is  accommodated  by  construction  of  a  local  'virtual'  element;  a  flat 
parallelogram  tangent  to  the  surface  at  the  location  of  the  singularity^.  The  tangent  element  allows  in  effect 
subtraction  of  the  hypersingular  term,  and  its  replacement  by  a  singular  contour  integral.  However,  since  this 
all  contours  meeting  at  titre  location  of  the  sin^arity  are  traversed  in  both  direcHons,  this  never  need  be 
evaluated.  (A  more  detailed  description,  and  investigation  of  the  significance  of  careful  treatment  of  such 
terms,  is  in  preparation.) 

Boundary  Conditions 

Equation  (6)  above  represents  3N  equations,  with  3N  terms  on  each  right  hand  side.  Imposing  perfect 
conductor  boundary  conditions  allows  both  the  number  of  equations,  and  the  right  hand  side  terms,  to  be 
reduced  to  2N. 

3.  Main  Solution  Stages  and  Cost  Scalings 

We  consider  below  the  relative  costs  and  costs  scaling  of  the  various  portions  of  the  analysis,  initially  for  a 
single  look  angle.  Figure  2  shows  the  relative  times  taken  for  the  various  stages,  on  a  series  of  increasingly 
finely  meshed  spheres. 

Matrix  formation 

Equation  (6)  shows  the  field  at  a  point  to  be  a  weighted  sum  of  the  historical  field  values  elsewhere  on  ^e 
surface,  with  the  weights  being  essentially  the  results  of  the  integrations  in  (8).  For  a  non-conducting 
dielectric  (vacuum),  the  Green  function  is  a  'pure'  delta  function,  so  that  only  the  field  at  exactly  the 
retarded  time  ago  is  of  relevance.  With  the  quadratic  modelling  of  the  temporal  variation,  typically  -5 
historical  values  are  required  to  approximate  this.  The  result  is  that  a  matrix  of  size  N  x  N  x  ~5  has  to  be 
formed.  Each  member  is  a  2x2  submatrix,  giving  an  overall  formation  and  storage  cost  of  ~20  N  .  Since  N  will 
vary  with  frequency  squared,  these  costs  both  scale  with  frequency  to  the  fourth  power. 

Matrix  solution 

The  right  hand  side  of  (7)  involves  historical  values  only  if  the  distance  to  the  nearest  integration  point  is 
larger  than  the  distance  light  travels  in  one  timestep;  otherwise  matrix  coefficients  multiply  as  yet  still 
unknown  field  values.  The  result  is  that  (7)  represents  an  implicit  matrix  equation,  rather  than  an  explicit 
formula  for  the  field.  The  matrix  is  very  sparse,  and  increasingly  so  as  larger  problems  are  addressed.  A 
matrix  equation  must  be  solved  at  each  timestep.  We  employ  an  iterative  (biconjugate  gradient,  or  CCS) 
method.  With  the  previous  timestep's  solution  as  the  initial  guess,  convergence  is  typically  obtained  in  a 
few,  or  at  most  a  couple  of  tens  of,  iterations.  It  is  found  that  the  solution  is  generally  a  small  and  declining 
fraction  of  the  total  work.  One  major  advantage  of  the  implicit  approach  is  that  a  stable  solution  is 
achieved.  Explicit  time  domain  integral  methods  have  been  found  to  suffer  from  instability;  we  can  induce 
instability,  by  modifying  timestep,  quadrature  order  and  mesh  size  to  approach  explicitness,  but  do  not 
observe  it  when  using  tire  codes  normally. 

Right  hand  .side  formation 

Equation  (7)  is  evaluated  at  each  timestep.  It  is  in  essence  a  large  matrix-vector  multiplication,  and  thus 
involves  summing  -20  products.  It  is  here  that  the  dominant  cost  of  tire  method  lies;  a  (frequency)  cost  at 
each  step,  with  -(frequency)  steps  required,  giving  an  overall  cost  scaling  of  (frequency)^. 

In-core  /  Out-of-cQK 

Once  the  matrix  size  exceeds  the  available  core  memory  it  must  either  be  read  from  disk  at  each  time  step,  or 
re-evaluated  as  required.  We  employ  botii  approaches;  for  a  single  processor  machine  tire  former  is  faster  by 
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evaluation  (or  reading)  rather  than  right  hand  side  formahon. 

A.  Parallelisation  Issues 

to  Sge  of  Ihe  surface  field  Wstery,  and  the  need  for  repeated  nght  hand  side  formation 

Matrix  formation 

for  the  time  domain  parallelises  equally  well. 

Right  hand  side  formation 

This,  the  dominant  cost  in  the  time  domain,  has  no  frequency  domain  counterpart. 

has  been  found  to  be  the  aliocation  of  portions  of  the  surface  (columns ,  in  ‘"or ^aWx 

processors.  Integration  over  its  portions  from  every  node  is  then  P^ormed  by  i„ 

formation  and,  mote  significantly,  for  tight  hand  side  formahon.  Tlus  inteerates  ovL  the  whole 

the  frequency  domain,  where  'nodes'  ate  allocated  to  processors,  and  each  processor  °y“ 

surface’  Thifapproach,  which  offers  considerable  gains  in  simpliaty,  requires  storage  of  the  cnhre  history  on 

each  processor. 

Achievement  of  opHmal  domain  decomposition  is  desirable  if  matrix  re-evaluation  is 

to  aUocate  equal,  Contiguous  portions  of  surface  to  each  processor,  shaped  to  mimmise  their  total  pen 

and  hence  the  influence  of  'halo  nodes'.  This  has  not  been  implemented. 

Matrix  solution 

As  is  seen  in  figure  2,  matrix  solution  is  a  tiny  part  of  the  total  cost,  and  accordingly  we  (yf*) 

^plemented  it  parallel.  As  will  be  seen  below,  the  very  good  parallelisation  the  method  ^ 

ItoLTrJStion  aCd  right  hand  side  formation  stages  does  increase  the  fraction  of  the  time  spent  on  solution 
quite  markedly,  and  this  is  now  an  issue  which  need  to  be  addressed. 

5.  Computational  Implementation 

The  code  was  designed  from  the  beginning  to  make  use  of  the  large  body  of 
generation  and  post-processing  and  display  software  available  comn^rcially. 
performed  in  a  solid  mCdeUing  package,  and  surface  meshes  generated 

my)  mesh  generation  software.  The  output  from  this  is  read  into  an  interface  f ^ 
input  file  for  Zeus.  Currently  interface  routines  for  the  packages  Patran  (Macneal  Schwendler)  and  Idea 

(SDRC)  exist. 

The  main  processing  in  Zeus  is  in  two  stages;  matrix  generation,  and  subsequent 

^eZTofZe'  cJs  discussed  below,  thfse  are  actually  combined).  The  output  of  tius  stage  s  a  ^sto^  o 
the  surface  currents.  If  the  res  is  required,  these  are  provided  to  an  res  routine  which  performs  a  Fourier 
Lisform  (or  equivalent).  If  the  currents  themselves  are  of  interest,  the  interface  process  is 
STreSe  provided  to'the  CAD  package  for  grapl^al  display.  This  can  be 

an  animated  display  of  the  evolution  of  the  fields.  The  option  also  exists  to  display  the  helds  space 

around  the  bodyffo^  example,  over  an  equatorial  plane.  Figure  3  shows  a  diagram  of  the  software  structur  . 
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6.  Examples  of  performance 

The  following  paragraphs  describe  a  few  examples  of  the  use  of  the  codes,  both  on  a  single  processor 
workstation  (DEC  Alpha  600, 256  Mb),  and  on  a  256  processor  Cray  T3D. 

Cylindrical  cavity 

Althou^  not  strictly  an  res  application,  the  cylindrical  cavity  in  figure  4  demonstrates  the  ability  to  predict 
time  dependent  surface  current  on  a  body  of  fairly  complex  shape.  Figure  5  shows  the  surface  current  induced 
on-axis,  inside  and  outside  the  rear  wall,  when  subject  to  an  incident  pulse  along  the  axis  into  the  open  end. 
The  pulse  width  was  about  one  cavity  length,  and  the  time  history  shown  extends  to  -5  transit  times. 

Almond;  3  GHz  (2.53  wavelengths,  1400  nodes) 

The  almond  (4154  node  mesh  shown  in  figure  6)  is  one  of  the  set  of  benchmark  targets  reported  by  Woo  .  A 
1202  node  mesh  was  used  to  generate  the  monostatic  res  shown  in  figure  7.  Also  indicated  are  measured  res 
values  (Woo),  and  values  calculated  by  Carlos^.  This  problem  was  run  in  core  on  the  DEC  workstation,  with 
the  90  look  angles  taking  a  total  of  9.3  epu  hours. 

Almond;  7  GHz  (5.89  wavelengths,  2450  nodes) 

This  will  not  run  in-core  on  the  serial  DEC  workstation;  with  matrix  re-evaluation  at  each  timestep,  one  look 
angle  took  ~30  hours;  subsequent  look  angles  would  take  ~3  hours  each.  Computed  and  measured  head-on  res 
were  both  -42dB  m^.  As  an  illustrahon  of  the  postprocessing  information  available,  figure  8  shows  the 
magnitude  of  surface  currents  induced  on  the  almond  (albeit  here  in  rather  less  informative  black  and  white 
than  the  colour  of  the  workstation.) 

As  an  illustration  of  the  effectiveness  of  parallelisation,  this  same  problem  was  run  twice  on  the  T3D  with  32 
and  64  processors.  For  a  single  look  angle,  tfie  total  times  were  807  and  409  seconds  respectively;  a  reduction  by 
a  factor  of  1.97,  indicating  hi^y  efficient  processor  utilisation. 

Cone-sphere  and  gap;  9  GHz  (18.15  wavelengths,  5924  nodes) 

Also  from  Wu,  this  18  wavelength  body,  showm  in  figure  9,  is  a  severe  test  for  the  2C)0Mb  DEC  workstation, 
and  only  a  single,  head-on,  illumination  was  run.  Matrix  re-evaluation  at  each  timestep  was  required,  taking 
one  hour  for  each  of  the  -250  timesteps  (-3  1/2  transits)  modelled,  a  total  time  of  -10  days.  Additional 
simultaneous  look  angles  would  take  rather  less  than  1  day  each. 

Extracting  a  0.869  GHz  result  gave  a  backscattered  res  result  of  -17.5  dB  m^,  the  same  as  the  body  of  revolution 
code  Cicero^.  A  frequency  of  9  GHz  corresponds  to  a  nodal  separation  of  -1/5  wavelength,  and  is  probably  at 
the  upper  end  of  the  frequency  for  which  the  mesh  is  adequate.  The  measured  and  computed  res  values  were 
both  — 6  dB  m^. 

Almond;  9.92  GHz  (8.36  wavelengflvs,  5090  nodes) 

Figure  10  shows  the  monostatic  res  of  the  almond,  illuminated  so  as  to  be  8.36  wavelengths  long,  with  some 
measured  values.  This  problem  was  run  on  the  T3D  with  256  processors.  Four  minutes  were  required  to  form  the 
matrix  the  once  required,  and  30  minutes  per  look  angle  for  right  hand  side  formation  and  solution. 

7.  Discussion  and  Conclusions 

A  curvilinear,  isoparametric  implementation  of  time  domain  integral  equation  methods  has  been 
demonstrated  to  be  an  effective  tool  for  EM  analysis.  Such  a  treatment  also  permits  the  important  practical 
advantage  of  direct  coupling  to  industry  standard,  commercial  CAD  and  pre-  and  post-processing  software. 

It  is  still  an  active  research  area.  The  principal  current  projects  include: 

-hybridisation  with  time  domain  optical  methods 
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-the  Inclusion  of  lossy  (RAM,  human  tissue)  target  materials 

-developments  which  promise  very  marked  reductions  in  both  storage  needs  and  operahons, 

S.  Adcnowtedgements 

The  developments  described  here  have  been  of  D^ence  Defence 

Macneal  Schwendler  Corporation  and  Thom  EMI  Electronics. 

Figure  captions 

Figure  1:  Mapping  of  general,  curved  surface  patches  into  flat  squares  in  intrinsic  space. 

Figure  2i  Relative  times  for  Ste  different  solution  stages  as  a  function  of  problem  size 
Figure  4:  Mesh  of  cylindrical  cavity 

Fieure  3:  Diagram  of  software  structure  . ,  ,  n 

Figure  5:  Magnitude  of  surface  current  induc^  on-axis  of  cylindncal  cavity,  inside  and  outsi  e 
when  subject  to  an  incident  pulse  along  tiie  axis  into  the  open  end.: 

Figure  6:  Mesh  of  NASA  almond;  4154  nodes 

Figure  7:  Computed  monostatic  (VV)  res  of  almond  at  3  GHz,  with  some  measured  and  Carlos  values  shown 
also. 

Figure  8:  Magnitude  of  induced  surface  currents  on  the  almond  at  7  GHz, 

Figure  9:  Mesh  of  NASA  cone-sphere  with  gap;  5924  nodes 

Figure  10:  Computed  monostatic  (W)  res  of  almond  at  9.92  GHz,  with  some  measured  values  shown  also. 
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DOUBLE  SCATTER  RADAR  CROSS  SECTIONS  FOR  TWO  DIMENSIONAL  RANDOM 
ROUGH  SURFACES  THAT  EXHIBIT  BACKSCATTER  ENHANCEMENT 
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In  this  work,  full  wave  solutions  for  the  single  and  double  scatter  radar  cross  sections  from  two  dimensional 
random  rough  surfaces  are  given.  The  solutions  are  expressed  as  multidimensional  integrals.  The  high  frequency 
approximations  are  used  to  reduce  the  double  scatter  integrals  from  twelve  to  four  dimensional.  The  single  scatter 
cross  section  is  given  in  closed  form.  The  large  radius  of  curvature  approximation  is  used.  The  incident  waves 
are  assumed  to  be  plane  waves. 

Similar  to  scattering  from  one  dimensional  random  rough  surfaces,  the  major  contributions  to  the  double 
scatter  cross  sections  come  from  two  different  pairs  of  paths.  They  are  the  quasi  parallel  double  scatter  paths  and  the 
quasi  antiparallel  double  scatter  paths.  The  total  incoherent  double  scatter  cross  section  is  the  sum  of  the  incoherent 
quasi  parallel  and  quasi  antiparallel  double  scatter  cross  sections. 

In  the  high  frequency  limit,  the  major  contributions  to  the  double  and  single  scatter  cross  sections  come 
from  the  neighborhood  of  the  specular  points  on  the  rough  surface.  Thus  the  surface  element  scattering  coefficients 
are  evaluated  at  the  specular  points  after  integrating  with  respect  to  the  slopes.  The  probability  density  functions 
of  the  slopes  are  assumed  to  be  Gaussian.  Shadow  functions  are  included  in  the  expressions. 

The  effects  of  changing  the  rough  surface  parameters,  such  as  mean  square  height  and  mean  square  slope, 
on  the  double  scatter  cross  sections  are  studied.  The  level  and  width  of  the  peak  in  the  backscatter  direction  depend 
on  the  mean  square  height  and  slope  of  the  rough  surface.  The  numerical  results  show  sharp  enhancement  in  the 
backscatter  directions.  This  sharp  backscatter  enhancement,  which  is  observed  for  all  polarizations  and  for  both 
normal  and  oblique  incident  angles,  is  associated  with  the  quasi  antiparaliel  double  scatter  paths. 


I.  FORMULATION  OF  THE  PROBLEM 


The  full  wave  solution  for  the  double  scatter  far  fields  G^(r)  from  two  dimensional  rough  surfaces 
{y  =  h(xs,Zs))  is  given  by  [1] 


^  (^)  ^  j  ’-^yexp  (jkon^  ■  f.a-)  exp  {jkoh'  ■  (f,2'  - 


Dr{n',n^)  ,  ^  un 


(1) 


dn'dni 


V{Tsv)V{rs2')dXsvdZsvdXs2'dZs2<G\^) 


in  which  the  time  harmonic  excitations  exp  (jwf)  are  assumed  and  the  free  space  wave  number  is 

The  incident  waves  are  in  the  direction  and  the  scattered  waves  are  in  the  direction  to  receive  at  f,  where 


(2a) 

fJ  =  nlcj:  4-  +  n{ez 

(2b) 

f  =  xlx  +  ysy  +  zez 

(2c) 
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The  scattering  matrices  at  points  1'  and  2'  on  the  rough  surface  are  and  D2' ,  ri') .  The  elements  of 

the  scattering  matrices  depend  on  the  local  slope  of  the  rough  surface  [1].  Moreover,  they  depend  on  the  polarization 
of  the  incident  and  scattered  waves  and  the  media  on  both  sides  of  the  rough  interface.  The  incident  fields  are 
assumed  to  be  plane  waves  and  the  receiver  is  located  in  the  far  field.  The  wavevectors  of  the  scattered  waves  at 

the  point  on  the  surface  at  rji-  are  in  the  direction  n'  =  +  rtyCy  +  n'C;.  (see  Fig.  1).  The  position  vectors 

to  points  V  and  2'  on  the  rough  surface  (see  Fig.  1)  are  given  by 

fsv  =  Xsvex  +  h{xsv,  Zsv)ey  +  Zsi'^z  (3a) 

rs2'  =  Xs2'ex  +  h{xs2>,  Zs2')ey  +  Zs2'ez  (3b) 


At  high  frequencies,  the  shadow  functions  U(fsi')  and  (7(fj2')  are  equal  to  one  if  the  point  at  fsi'  is  illuminated 
by  the  incident  waves  and  visible  at  point  2'  on  the  surface  and  if  the  point  at  rs2'  is  illuminated  by  a  point  source 
at  1'  and  visible  at  the  receiver  [2].  The  double  scatter  average  cross  section  is  obtained  by  multiplying  (1)  by  its 
complex  conjugate.  The  radar  cross  section  for  the  two  dimensional  rough  surface  is  defined  as 


in  which  A  is  the  radar  footprint,  thus 

J  (nl  -  nQ  {^4  ~  n")  (-n'  +  n'y)  (-n^  +  n") 

X  exp  {i/ro[wx(42  -  ^”2)  +  -  ^2")  +  n{{z,2'  -  ^s2")]  } 

X  exp  {~jkQ[4{xs2'  -  Xsv)  +  n'y{h2'  ~  fir)  +  n'^{zs2-  -  ^si')]  ) 

X  exp  {^jkQ\n'x{^s2"  —  3;si")  +  ^^(fiz"  ~  fii")  +  zi"[zs2"  —  } 

X  exp  {-ifioK(a;sr  -  +  n' (fii-  -  fir')  +  4{^sV  -  z^v)]  } 


h -n' 4  -n'4  Jl  ~  n'l  ^ 


dx  dz  dx  ^2'  dz^2'  dx  si"  dzgi"  dx  $2“  dzs2" 


For  the  quasi  parallel  double  scatter  paths  (n^,n"  <  0  and  >  0;  a  =  x.z),  the  following  transformations 

of  variables  are  used: 

=  isv  -  ar^r'i  0:^2  =  Xj2' -  a:s2";  (6a) 


Zdl  —  Zsi'  —  Zsl",  Z^2  =  Zs2'  —  252" 


Sal  =  (at^r  +  ^sl")/2,  Xa2  =  (Xs2'  +  X52")/2  (7a) 

Zal  =  {Zsl'  +  Zsi")/2,  Za2  =  (Zs2'  +  ^s2")/2  (7b) 
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Thus  from  (5),  (6)  and  (7),  one  gets  for  the  quasi  parallel  case 

ibg  r  D2'{n^,n')D2"in^  {n",n^) 

16^^®  J  (n(  -  n(,)  (4  -  n")  (-n*  +  n^)  {-n\  +  n'>) 

X  exp  [jko  [n{.Xd2  -  n^(0.5id2  -  0.5a;(ji  +  Xa2  -  a^oi)  +  n'^{~0-bXd2  +  0.5xdi  +  Xa2  ~  ^ai)  -  nlxdi] } 
X  exp  [nlzd2  -  n'^{^.^Zd2  -  O.S^di  +  Za2  -  ^al)  +  ^"(-0.52^2  +  O.Bzdl  +  Za2  -  Zal)-  n\zdl]  } 

X  exp  {jko[nl{h2'  -  h2")  -  ny{h2'  -  hi<)  +  riyiho't  -  /ii»)  -  n^(/ii'  -  /ii")]  } 


dn'’dn'l 


-dxdldzdldx  d2dzd2dx  aldZaidx  a2dza2 


(8) 


The  major  contributions  to  the  double  and  single  scatter  cross  sections,  in  the  high  frequency  limit,  come 
from  the  neighborhood  of  the  specular  points  of  the  rough  surface.  The  heights  at  any  two  neighbor  points  on  the 
two  dimensional  rough  surface  are  expanded  and  written  as  functions  of  the  heights  and  slopes  at  the  midpoint 
between  them.  The  heights  at  points  T  and  I"  on  the  rough  surface  are  expanded  about  the  heights  at  the  midpoint 
between  them.  The  statistical  average  of  the  radar  cross  section  (8)  with  respect  to  the  random  heights  and  slopes 
of  the  surface  is  taken.  Upon  changing  the  integration  variables  n'y ,  n",  n'. ,  and  n"  to  the  spherical  coordinate 
variables  t?'  and  d)'  one  gets 


PQ\ _  {2koLmy 
dp 


■P2{n')p2{n^) 

R,S=V,H 


/ 


[up  {nf ,  n')  d19  {ft',  n^) ,  n")  {ft",  n^) ' 

(ni  -  n;)  (nl  “  ^y)  (“4  +  ^y)  (“"y  + 


_ PjKcls,  hxc2s!  k^clsi  kzc2s) _ 

[l  -  j)]  [l  -  P2(|n"|)]  [fcoLmiK  -  <)]  sine  [koLm{n',  -  n'l)] 


X  exp  {~{h'^)kl{n'y  -  sirt’d' sin  ^"dd'dd" dtp' d<p" 


(9) 

The  integrations  with  respect  to  Xdi,  Xd2,  ^di,  and  Zd2,  yield  Dirac  delta  functions  in  the  high  frequency 
limit.  In  (9)  the  surface  element  scattering  coefficients  are  evaluated  at  the  slopes  at  the  specular  points 
kxcis,  hxc2s,  hzcis  and  as  a  result  of  integrating  the  Dirac  Delta  functions  (5{.)  with  respect  to  the  random 

slopes.  The  probability  density  functions  of  the  (large  scale)  slopes  pik^ci,  hxc2,  k~c2)  are  assumed  to  be 

Gaussian.  The  slopes  at  the  specular  points  are  given  by 


hxcls  — 


-<  +  - 1/  + 


n'y  +  n" 


hxc2s  = 

hzcls  = 


I 


+  2 


-n,  +  — ^ 


(10a) 

(10b) 


(lOc) 


207 


hzc2s  - - nl  + 


(lOd) 


n’  + 


+ 


n'y  +  n'l 

2 


The  integrations  over  Xd,  Xa2,  Zai  and  yield  the  footprint  area  A  and  the  sine  functions.  Furthermore,  Lm  is 
the  mean  width  of  a  typical  depression  on  the  rough  surface  [1],  [3],  [4],  The  probabilities  that  the  surface  does  not 
shadow  the  incident  and  scattered  waves  are  given  by  P2(h')  and  P2(S-^),  respectively  [2],  and  [1  -  P2]  is  associated 
with  the  probability  of  a  double  scatter  event.  The  symbols  V  and  H  are  for  vertical  and  horizontal  polarizations. 
For  the  quasi  antiparallel  double  scatter  paths,  the  following  transformations  of  variables  are  used  in  (5): 


—  Xs2"-, 

^d2  =  3:s2’  -  Xsl", 

(11a) 

^dl  =  ^sV  — 

Zd2  =  *s2'  —  *sl" 

(lib) 

Thus,  one  gets  the  following  expression  for  the  high  frequency  quasi  antiparallel  (n'^  <0,  Hq  >  0  and  > 
0,  <  0;  a  =  x,z)  double  scatter  cross  section 


PQ\  _  {2kaLmf 
dp  i  ~  jr 


R,s=v,H  [4-  (nl  -  n")  +  n'y)  (-n^  +  n") 

yjhxcls-:  hids:  Ilzc2s  ) 


((4  +  n'  -  n'>  -  n^)/2)“((n^  -  n'  +  n''  -  n‘)/2)' 

[1  -  Padn' I)]  [1  -  P2(|n"|)]  sine  [koLm{nl  +  n],  -  -  n")]  sine  [koLm{n{  +  n'  -  n'.  -  n")] 


X  exp  (^-{h')kl[n{j  -  n'y-  n'y  +n^)”^  sin  d' sin  d^' dip' dip"  j 
The  slopes  at  the  specular  points  for  the  quasi  antiparallel  case  are  given  by 

h^cu  =  -[-<  +  <-  <  +  71^]  /  -  n'y  +  n'y  -  n"] 

hxc2s  =  -  [4  -  <  +  K  -  <]  /  [ni  -n'y  +  n"  -  tiJ,] 
hzcis  =  -  [n/  -  n;  +  <  -  7i'']  /  ^nl  -  Tii,  +  n'y  -  n''] 
hzz2s  =  -[«{-<  +  <  -  n^j  /  -n'y  A  n'^  -  n^,] 


(12) 

(I3a) 

(13b) 

(13c) 

(13d) 


The  sharp  enhancement  in  the  backscatter  direction  {—n'  =  n^)  is  associated  with  the  quasi  antiparallel  {n'  ss  — n") 
double  scatter  cross  section  (12).  Note  the  difference  in  the  expressions  for  the  slopes  at  the  specular  points  for 
the  quasi  parallel  and  the  quasi  antiparallel  cases  (10)  and  (13).  For  backscatter  at  normal  incidence  the  major 
contributions  to  the  double  scatter  cross  sections  come  from  quasi  horizontal  paths  between  points  1  and  2  (see  Fig. 
1)  and  the  slopes  at  the  stationary  points  are  approximately  ±45°. 
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II.  NUMERICAL  EXAMPLES 


The  incoherent  double  scatter  cross  sections  (quasi  parallel  +  quasi  antiparailel)  for  two  dimensional  rough 
surfaces  are  plotted  in  Figs.  2,  3  and  4  as  functions  of  the  scatter  angle  cos  iff  (where  y)*  =  0,  =  0,  The 

two  dimensional  rough  surface  is  assumed  to  be  coated  with  gold  of  permittivity  Sj-  =  -9.888312  -  >1.051766  at 
A  =  0.633/im.  The  incident  angle  is  equal  to  10°.  The  Rayleigh  roughness  parameter  0  =  Ak^{h  )  is  assumed 
to  equal  394.105  in  Figs.  2  and  4.  In  Figs.  2  and  3,  the  VV  polarized  double  scatter  cross  sections  are  shown. 
The  effect  of  changing  the  mean  square  slope  (m.s.s.)  is  shown  in  Fig.  2  where  it  is  assumed  to  be  0.25,  0.4, 
0.5,  0.65,  0.75,  0.85  and  1.0.  The  level  of  the  peak  enhanced  backscatter  cross  section  is  highest  for  mean  square 
slope  equal  to  1.0  and  it  decreases  as  the  mean  square  slope  decreases  to  0.25.  In  Fig.  3,  the  mean  square  slope 
is  assumed  to  be  0.5  and  the  Rayleigh  roughness  parameter  is  assumed  to  be  10,  50,  100,  200,  300,  and  394.105. 
The  level  of  the  peak  enhanced  backscatter  increases  as  the  Rayleigh  roughness  parameter  increases.  The  width 
of  the  enhanced  backscatter  peak  increases  as  both  the  Rayleigh  roughness  parameter  and  the  mean  square  slope 
decrease.  The  cross  sections  for  different  polarizations  {VV,  HH,  VH,  HV)  are  shown  in  Fig.  4  in  which  the 
mean  square  slope  is  assumed  to  be  0.5  and  0  is  assumed  to  be  394.105.  Enhanced  backscatter  is  observed  for  the 
four  polarizations  considered.  The  levels  of  the  peak  double  scatter  cross  sections  (in  the  backward  direction)  are 
approximately  the  same  for  both  the  like  and  cross  polarized  cases.  The  sharp  backscatter  enhancement  observed 
for  all  polarizations,  is  associated  with  the  quasi  antiparallel  double  scatter  paths. 


CONCLUSIONS 


The  results  for  the  double  scatter  radar  cross  sections  exhibit  sharp  enhancement  in  the  backscatter  direction 
at  normal  and  oblique  incident  angles.  This  sharp  enhancement  is  associated  with  the  quasi  antiparailel  double  scatter 
path.  The  height  and  width  of  the  peak  in  the  backscatter  direction  depends  on  the  mean  square  height  and  slope 
of  the  two  dimensional  random  rough  surface.  The  high  frequency  approximations  make  the  computations  more 
tractable;  however,  the  polarization  dependence  is  less  obvious  [3]. 
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Double  scatter  only 


Fig.  2,  SMOO,  0r=-9.888312-jl.O51766,  ^.=0.633  p=394.105. 


Double  scatter  only 


Fig.  3, 9^10®,  £i.=-9.888312”jl.05l766,  X=0.633  fi.m.,  mean  square  slope=0.5 
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Abstract 

The  paper  compares  performance  of  two  public  domain  procedures  based  on  the  QR  and  Arnoldi 
methods  for  the  solution  of  dense  matrix  eigenvalue  problems  on  a  medium  cost  multiprocessor 
suuperscalar  SGI  PowerChallenge  architecture.  The  propagation  of  waves  in  a  rectangular  image 
guide  is  used  as  an  illustration.  It  is  found  that  not  only  is  the  Arnoldi  method  better  suited  for 
the  solution  of  typical  computational  electromagnetics  eigenproblems,  but  also  can  exploit  the 
computer  hardware  features  more  efficiently  than  the  QR  method, 

I.  Introduction 

One  of  the  current  trends  in  computer  hardware  are  the  new  medium  cost  multi  processor 
architectures  built  around  superscalar  RISC  processors.  It  has  been  observed  [4]  that  computer 
hardware  trends  will  have  significant  implications  on  how  the  the  computational  electromagnetics 
(CEM)  codes  are  developed.  It  is  important  then  to  reconsider  the  basic  CEM  problems  in  order 
to  find  algorithms  which  may  take  the  advantage  of  the  available  software  and  hardware  resources. 
The  most  important  issue  which  have  to  be  addressed  is  memory  bandwidth.  All  modern  medium 
cost  workstations  and  multiprocessor  systems  rely  on  the  hierarchical  memory  structure  with  the 
data  and  instruction  caches  integrated  on  the  chip  and  the  a  larger  secondary  cache  capable  of 
fetching  data  in  one  cycle.  This  hierarchical  memory  structure  can  be  used  to  obtain  significant 
speedup  on  electromagnetic  simulations  in  a  multiprocessor  system. 

II.  Numerical  algorithms 

Many  engineering  problems  require  the  solution  of  an  eigenvalue  equation.  For  instance,  in 
electromagnetics,  finding  modes  in  waveguides  or  resonators  is  equivalent  to  finding  eigenvalues 
and  eigenfunctions  of  a  differential  or  integral  operator  derived  from  Maxwell’s  equations.  The 
most  popular  way  to  solve  these  problems  is  to  approximate  the  operator  through  a  matrix  via 
some  sort  of  a  projection.  Different  projection  methods  generate  various  matrix  formats.  For 
instance  the  method  of  moments  with  entire  domain  basis  functions  yields  dense  matrices,  while 
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the  finite  difference  frequency  domain  method  (FDFD)  or  the  finite  element  method  both  generate 
sparse  matrices. 

In  the  simplest  case  the  matrix  eigenvalue  problem  to  be  solved  has  the  following  form 

Au.  =  Au  (1) 

where  ^  is  a  matrix  of  the  size  n  and  A  and  u  is  the  eigenvalue  and  eigenvector,  respectively. 
Usually  only  the  eigenvalues  (and  the  corresponding  eigenvectors)  located  near  the  upper  or  lower 
limit  of  the  operator  spectrum  are  of  the  practical  interest. 

Solving  eigenproblems  requires  the  use  of  appropriate  numerical  algorithms,  which  differ  one 
from  another  with  respect  to  the  numerical  complexity  and  the  memory  access  pattern.  In  the 
solution  of  a  specific  problem,  the  operator  matrix  might  be  symmetric  or  non-symmetric,  dense 
or  sparse.  The  non-symmetric  problems,  arising  in  the  investigation  of  modes  in  inhomogeneously 
loaded  waveguides  waveguides,  are  more  difficult  to  solve.  In  the  case  of  a  dense  non-symmetric 
matrix  the  traditional  approach  it  to  apply  the  numerical  algorithm  known  as  the  QR  method 
[1].  The  method  consists  in  reducing  the  matrix  to  the  Hessenberg  form  via  Householder  trans¬ 
formations  and  finding  the  Schur  form  in  a  series  of  QR  steps  [1].  Diagonal  elements  of  the  Schur 
matrix  are  equal  to  the  eigenvalues  A  of  ^ . 

This  method  has  a  few  disadvantages  and  the  most  important  of  them  are: 

•  The  high  numerical  cost  of  the  computation  of  the  order  O(iV^).  This  is  due  to  the  fact 
that  the  QR  calculates  of  all  the  eigenvalues  of  the  operator.  (Note  that  in  electromagnetic 
problems  usually  only  the  eigenvalues  located  near  the  upper  or  lower  limit  of  the  operator 
spectrum  are  to  be  found). 

•  Operation  on  dense  matrices,  which  makes  this  method  impractical  for  the  sparse  non-sym- 
metrical  problems. 

The  QR  method  is  implemented  in  several  libraries.  One  of  the  most  efficient  implementation 
is  the  DGEEV  routine  from  LA  PACK  public-domain  library  [6].  The  LAPACK  procedures  rely  on 
the  BLASS  operations  optimized  for  memory  access  pattern.  Hand  tuned  BLAS  routines  are 
also  often  provided  with  the  system  thereby  ensuring  the  most  optimal  performance  of  LAPACK 
routines  on  a  given  computer  architecture.  The  QR  method  is  easy  to  use  and  straightforward  in 
parameter  selection. 

Recently  an  alternate  approach  based  on  the  Arnoldi  [2][1]  method  has  attracted  the  attention 
of  numerical  algebra  experts.  The  particular  feature  of  the  Armoldi  method  is  that  it  enables 
calculation  of  selected  eigenvalues  (and  the  corresponding  eigenvectors),  and  does  not  require  the 
information  of  the  layout  of  individual  matrix  entries.  The  Arnoldi  method  is  based  on  the  concept 
of  Krylov  subspaces  Km 

Km.=s'^^ri{v^Av,A^v....,A^~^v]  (2) 

where  v  is  an  vector.  The  Arnoldi  method  constructs  a  non  square  Y^m  matrix  with  the  columns 
forming  an  orthonormal  basis  of  the  subspace  Km-  The  construction  of  the  Ym  does 

not  require  the  knowledge  of  A  and  relies  only  on  the  data  obtained  from  the  multiplication  of 
suitably  selected  vectors  with  A-  This  operation  can  be  done  very  fast  using  the  user  specified 
routines  (for  instance  with  the  GEMV  form  BLAS2).  The  desired  eigenvalues  and  eigenvectors 
of  matrix  A  are  then . recovered  based  on  the  information  obtained  in  the  construction  of  Ym- 
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The  size  of  is  n  x  m  where  n  is  the  size  of  the  eigenvalue  problem.  In  the  Arnold!  method, 
m,  the  size  of  the  Krylov  space,  is  usually  only  a  few  times  larger  than  the  number  of  desired 
eigenvalues.  This  implies  that  a  relatively  small  additional  memory  space  and  a  modest  number 
of  Au  operations  is  required  for  the  problems  in  which  only  a  few'  eigenvalues  are  of  interest. 

The  Arnoldi  method  can  be  implemented  in  many  ways.  In  our  w'ork  w'e  used  version  1.1  of 
the  DNAUPD  subroutine,  taken  from  the  one  of  the  NETLIB  packages  ARPACK  [5].  This  proce¬ 
dure  is  an  implementation  of  the  implicit  restarted  fc-step  Arnoldi  update  algorithm  [3].  It  is  a 
particularly  versatile  routine  due  to  reverse  communication  scheme  -  an  iterative  process  w'hich 
consists  in  performing  certain  operations  outside  the  subroutine.  In  case  of  the  Arnoldi  method, 
every  iteration  of  the  DNAUPD  requires  the  product  of  the  matrix  A  and  the  vector  v  provided  by 
the  procedure.  Matrix  A  is  not  passed  to  the  routine.  In  this  case  format  of  the  matrix  is  not  im¬ 
portant  for  the  procedure,  it  can  be  dense,  sparse  or  produced  “on-line”.  It  allow's  the  application 
of  fast  and  efficiently  optimized  mathematical  subroutines  intended  for  a  particular  format  of  the 
matrix. 


Ill,  Results 

We  compared  the  numerical  efficiency  of  the  Arnoldi  and  QR  methods  on  a  multiprocessor 
architecture  for  a  problem  of  finding  a  few'  low  order  modes  in  a  rectangular  waveguide  loaded  with 
a  dielectric  slab.  The  problem  is  converted  into  a  matrix  eigenvalue  equation  using  the  method 
of  moments.  The  resultant  matrix  is  dense  and  nonsymmetric.  The  eigenvalues  of  the  matrix  are 
then  solved  using  both  algorithms  on  a  multiprocessor  SGI  POWER  Challenge  system. 

Fig.  1  show's  the  total  computation  time  versus  the  problems  size  using  the  QR  algorithm 
for  the  increasing  number  of  processors.  Fig.  2  show's  the  speedup  due  to  the  multiprocessor 
execution.  The  speedup  is  defined  as  the  ratio  of  computation  time  for  n  >  1  processors  to  the 
execution  time  for  a  single  processor  system.  It  is  seen  that  the  QR  algorithm  does  not  scale  w'ell. 
The  speedup  for  4  processor  does  not  exceed  1.5.  and  the  speedup  does  not  change  much  with 
the  problem  size.  The  situation  is  drastically  different  for  the  Arnoldi  method  (Fig.  3).  For  this 
method  w'e  limited  the  number  of  eigenvalues  to  four,  corresponding  to  four  lowest  order  modes. 
As  might  be  expected  the  computation  time  is  much  shorter  than  for  the  QR  method.  However, 
the  most  interesting  properties  of  the  Arnoldi  method  show  up  when  one  considers  the  speedup 
curves  (Figs  4,5).  For  some  problem  sizes  the  speedup  exceeds  the  number  of  processors.  This 
super  linear  speedup  is  related  to  the  efficient  use  of  the  memory  bandwidth.  For  the  system  used 
in  testing  the  secondary  cache  size  per  CPU  was  1MB  w'hich  corresponds  to  the  dense  matrix  of 
the  size  N  =  370.  Increasing  the  number  of  processors  increases  the  fast  access  memory  available 
to  the  algorithm.  The  QR  algorithm  operates  on  individual  matrix  elements  and  has  to  hold  in 
the  memory  the  matrix,  its  Hessenberg  decomposition  and  then  its  reduced  form.  The  procedure 
runs  out  of  cache  w'hen  N  >  370  and  increasing  the  cache  by  adding  processors  has  little  effect 
since  data  has  to  be  moved  between  processors.  On  the  other  hand  the  Arnoldi  requires  little 
memory  in  addition  the  memory  required  for  storing  the  matrix  A  itself.  When  the  number  of 
processors  is  increased  the  matrix  is  distributed  over  caches  and  w'hat  is  even  more  important  the 
only  operation  on  the  matrix  elements  is  a  simple  matrix  times  vector  product.  This  operation, 
when  optimized  for  a  specific  multiprocessor  architecture,  does  not  generate  significant  amount 
of  data  traffic  between  processors.  As  a  result  the  Arnoldi  method  can  make  an  efficient  use  of 
the  increased  amount  of  secondary  level  cache  available  with  multiprocessor  execution.  Obviously, 


216 


when  the  matrix  data  becomes  too  large  or  can  be  held  within  a  single  processor  cache  the 
super  linear  speedup  disappears  since  the  access  time  is  comparable  for  single  and  multiprocessor 
execution.  This  effect  is  clearly  seen  in  Fig.  5  for  =  338  and  N  =  1682. 

It  is  now  interesting  to  compare  the  relative  performance  performance  of  the  two  algorithms. 
The  results  are  shown  in  Fig.  6  where  the  ratio  of  the  execution  times  of  the  Arnoldi  code  to 
the  QR  code  for  single  and  multiprocessor  case  is  plotted  versus  the  problem  size.  It  is  seen  that 
due  to  the  more  efficient  use  of  cache,  the  Arnoldi  method  parallelizes  much  better  than  the  QR 
method.  While  for  a  single  processor  case  the  Arnoldi  method  offers  up  to  24  fold  reduction  in 
the  CPU  time,  the  parallelization  of  both  algorithm  results  in  the  Arnoldi  code  giving  up  to  70 
times  shorter  execution  times  than  the  parallelized  version  of  the  QR  method. 

IV.  Conclusions 

The  Arnoldi  method  combined  with  the  reverse  communication  technique  is  a  very  powerful  new 
technique  much  more  suitable  to  the  solution  of  electromagnetic  eigenproblems  with  dense  matrices 
than  a  very  popular  QR  method.  It  enables  calculation  of  only  selected  eigenvalues  and  inherently 
preserves  the  locality  of  data  reference  which  implies  good  performance  on  the  multiprocessor 
super  scalar  architectures 
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Figure  1:  Total  time  spent  in  the  computation  of  the  eigenvalues  using  the  QR  method  for  various 
problem  sizes  and  various  number  of  processors.  The  parallel  version  of  the  DGEEV  (LAPACK) 
routine  from  complib .  sgimathjnp  has  been  used. 


Figure  2:  Speedup  of  the  computation  of  the  eigenvalues  using  the  QR  method  for  various  problem 
sizes  and  various  number  of  processors.  The  parallel  version  of  the  DGEEV  (LAPACK)  routine  from 
complib . sgimath_mp  has  been  used. 
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Figure  5:  Speedup  of  the  Arnoldi  algorithm  for  various  problem  sizes  and  various  number  of  pro¬ 
cessors.  The  super  linear  speedup  is  seen  for  N—882. 


Figure  6:  Speedup  of  the  Arnoldi  method  over  the  QR  method  for  various  problem  sizes  and  various 
number  of  processors. 
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Figure  3:  Time  required  for  finding  4  lowest  order  modes  with  the  Arnoldi  algorithm  for  various 
problem  sizes  and  various  number  of  processors.  The  parallel  version  o/DGEMV  (BLAS2)  routine 
from  complib .  sgimath_mp  has  been  used  for  matrix  x  vector  operation. 
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Figure  4:  Speedup  of  the  Arnoldi  algorithm  for  various  problem  sizes  and  various  number  of  pro¬ 
cessors.  The  parallel  version  o/DGEMV  (BLAS2)  routine  from  complib .  sgimathjnp  has  been  used. 
Note  the  strong  super  linear  speedup. 
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ABSTRACT 

The  hybrid  FDTD/FVTD  conformal  algorithm  is  implemented  on  a  massively  parallel 
computer  for  calculating  electromagnetic  scattering  of  smooth  surfaces  with  unstructured 
triangles.  Necessary  modifications  from  the  serial  code  to  the  parallel  code  to  improve 
computing  speed  and  reduce  memory  space  are  discussed.  The  parallel  FDTD/FVTD 
code  run  on  the  MASPAR  1 104  is  ten  times  faster  than  the  serial  code  run  on  the  Sparc 
10.  Numerical  simulation  of  the  parallel  code  is  validated  through  the  RCS  of  a  PEC 
sphere  by  comparing  with  the  Mie  series. 

INTRODUCTION 

The  hybrid  finite  difference  and  finite  volume  time  domain  (FDTD/FVTD)  algorithm 
have  been  implemented  on  both  serial  and  parallel  computers.  The  serial  code  have  been 
tested  intensively  on  both  SUN  and  IBM  (RS6000)  workstations.  The  development  of 
hybrid  FDTD/FVTD  techniques  can  be  found  in  papers[l'3].  In  this  paper,  we  discuss  the 
implementation  of  the  conformal  FDTD/FVTD  code  on  a  massively  parallel  machines  . 
The  parallel  code  accepts  unstructured  triangles  for  smooth  objects  the  same  as  the  serial 
code.  Due  to  different  computer  architectures,  some  subroutines  in  the  serial  code  need  to 
be  modified  to  fit  to  the  parallel  computation.  Our  effort  is  to  improve  the  computing 
speed  and  reduce  the  memory  space. 

The  parallel  computer  MASPAR  1104  has  4K  processors.  The  computer  consists  of  a 
front  end  machine  ,  the  scalar  unit  and  a  single  instruction  multiple  data  (SIMD)  parallel 
units  .  Scalars  are  stored  on  the  front  end  and  the  parallel  arrays  are  stored  on  the  parallel 
unit.  The  code  is  written  in  FORTRAN  and  the  variables  used  in  paraUel  FORTRAN  90 
constructs  are  stored  on  the  parallel  unit  and  mapped  across  the  processors. 
Communication  between  the  front  end  and  the  parallel  unit  is  very  time  consuming  and 
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must  be  minimized.  We  also  need  to  maximize  the  number  of  processors  computing  at 
any  one  time  and  minimize  router  communication  between  the  processors. 

The  massively  parallel  FDTD/FVTD  code  contains  two  parts;  A)  preprocessing  routines 
which  are  executed  only  once  before  the  FVTD/FDTD  calculation  for  inputting  the  grid 
point  data,  and  generating  the  tables  of  dependency  graph,  interpolation  coefficients  and 
parameters.  B)  time  domain  FDTD/FVTD  computing  routines  for  the  recursive  loops  of 
updating  E  and  H  fields  and  calculating  the  far  field  time  domain  electric  field  and  RCS. 
Due  to  the  complicated  relations  between  the  nodes  and  elements  in  the  unstructured  grid, 
all  the  calculations  which  involve  the  dependency  graph  between  nodes  and  elements 
must  be  done  in  the  preprocessing  routines  as  much  as  possible.  The  preprocessing 
routines  also  have  advantages  for  solving  scattering  problems  with  very  large  objects, 
since  the  same  pre-calculated  data  can  be  routinely  used  for  different  analysis  cases. 

A  brief  discussion  of  the  hybrid  FDTD/FVTD  algorithm  is  given  as  well  as  the 
interpolations  between  two  overlapping  grids,  the  far  field  extrapolation  and  the  boundary 
simulations.  Software  structure  of  the  parallel  FDTD/FVTD  code  and  the  necessary 
modifications  in  the  subroutines  so  that  it  can  take  advantage  of  parallelism  and  avoid  the 
computational  bottleneck  caused  by  the  communication  between  front  end  and  parallel 
unit  are  discussed.  The  performance  of  the  parallel  code  is  evaluated  by  comparing  with 
the  computing  time  of  the  serial  code.  Numerical  results  of  the  RCS  from  a  PEC  sphere 
obtained  by  the  parallel  code  are  compared  with  the  Mie  series. 

The  OVERLAPPING  FDTD/FVTD  HYBRID  Method 

The  development  of  the  hybrid  FDTD/FVTD  method  is  given  in  [2].  The  integral  forms 
of  Maxwell's  equations  that  are  used  in  the  time  domain  numerical  solutions  are  given  by: 

-  \  B» da  =[  E*di  Faraday's  Law  (line- surface  form)  (la) 

iA  SSa 

=  •  dT  Ampere's  Law  (line-surface  form)  (lb) 

-j^Bdv=jn^xEda  Faraday's  Law  (surface- volume  form)  (2a) 
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f  f  hx-Hda  Ampere's  Law  (surf ace- volume  form)  (2b) 

iv  J3V 


The  FDTD  (generalized)  algorithm  is  developed  from  the  discretization  of  Eqs.  1  and  the 
FVTD  algorithm  is  based  on  the  discretization  of  Eqs.  2.  For  more  details  on  these 
relations  and  the  derivations  of  their  discretized  forms,  please  see  the  references  [2]. 
Similar  to  the  original  [5],  the/  hybrid  is  also  time-domain  leap-frog  and  spatially 
staggered.  The  overlapping  grid  consists  of  a  rectangular  grid  and  a  body  conferral  grid. 
In  the  rectangular  grid,  we  assign  the  field  components  as  in  the  original  FDTD.  The 
conformal  grid  consists  of  several  layers  of  triangular  prisms  constructed  from  the  surface 
triangles.  The  electric  vectors  are  assigned  at  the  vertices  and  the  magnetic  vectors  at  the 
centroids  of  these  triangular  prisms.  The  outer  boundary  electric  vectors  on  the  FVTD 
primary  grids  are  obtained  through  interpolation  of  the  calculated  rectangular  FDTD 
electric  fields;  the  inner  boundary  of  the  rectangular  electric  fields  are  obtained  through 
interpolation  of  the  calculated  FVTD  electric  vectors.  For  the  electric  vertices  at  the 
boundary,  we  make  use  of  the  physical  boundary  condition;  for  the  outer  boundary  of  the 
rectangular  grid  we  apply  the  absorbing  boundary  condition.  The  near-to-far  field 
extrapolation  can  be  obtained  either  from  the  calculated  fields  on  the  rectangular  box  of 
the  FDTD  grid  or  from  the  fields  on  the  unstructured  triangles  of  the  FVTD  conformal 
grid. 

SOFTWARE  STRUCTURE  OF  THE  PARALLEL  FDTD/FVTD  CODE 

A)  Pre-Processing  program 

In  developing  the  parallel  FDTD/FVTD  code,  the  algorithm  must  be  mapped  to  the 
machine  to  take  advEintages  of  any  parallelism.  Unfortunately,  for  the  unstructured  grid, 
the  dependency  graph  between  the  nodes  and  elements  for  primary  grid  and  dual  grids  are 
too  complicated  to  fit  to  the  massively  parallel  architecture.  To  overcome  this  difficulty, 
we  separate  the  program  into  two  phases;  the  pre-processing  phase  and  the  routinely 
FDTD/FVTD  calculating  phase.  The  pre-processing  program  generates  tables  of 
dependency  graph  for  both  triangular  elements  and  the  vertices,  calculates  the  area 
normal  (area  vectors)  and  volumes  of  primary  and  dual  grids  and  generates  the  table  of 
interpolate  coefficients.  The  preprocessing  program  can  be  run  either  on  the  serial 
computer  or  the  parallel  computer.  The  data  files  generated  from  the  preprocessing 
program  are  then  read  by  the  FVTD/FDTD  calculating  program  to  do  the  routine 
computation.  Due  to  the  complicated  relations  in  the  unstructured  dependency  graph,  the 
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difficulty  in  obtaining  the  interpolation  coefficients,  and  since  these  routines  are  called 
only  once,  the  parallelization  of  the  preprocessing  subroutines  was  deemphasized,  and 
therefore,  the  computational  speed  of  the  parallel  version  of  preprocessing  program  is 
slower  than  the  serial  code.  In  solving  a  large  problem,  we  can  run  the  pre-processing 
program  on  a  serial  machine  and  then  dump  the  precalculated  data  files  to  the  well 
parallelized  FDTD/FVTD  code  to  do  the  routine  calculations.  The  precalculated  data  files 
can  also  be  reused  for  analyzing  different  cases  with  the  same  scattering  geometry. 

B)  The  parallel  FDTD/FVTD  calculating  program 

The  parallel  hybrid  FDTD/FVTD  is  based  on  the  serial  code  with  necessary 
modifications  to  improve  the  computational  speed  and  reduce  the  memory  space.  We  first 
eliminate  computational  bottlenecks  in  some  subroutines  to  improve  the  speed  and  limit 
the  array  dimensions  to  avoid  massive  memory  requirements  so  that  the  code  is  able  to 
solve  a  very  large  problem.  The  subroutines  of  the  parallel  FDTD/FVTD  code  and  major 
differences  from  the  serial  code  are  in  the  following: 

1)  In  subroutine  EXTRAP,  we  replace  the  near-to-far  field  extrapolation  from  the 
rectangular  box  in  FDTD  grid  by  the  extrapolation  from  the  triangles  of  the  conformal 
FVTD  grid.  The  far  field  extrapolation  subroutine  obtains  far  fields  from  the  calculated 
field  components  on  an  enclosed  surface  using  the  Stratton-Chu  formula.  For  a 
rectangular  box,  the  orientation  of  field  components  on  six  faces  are  all  different,  the 
bookkeeping  is  complicated  and  is  difficult  for  parallel  computing.  Instead  of  calculating 
the  far  field  from  the  rectangular  box,  we  obtain  the  extrapolated  far  field  from  the 
triangles.  Since  all  triangular  elements  are  on  the  same  zone  and  oriented  in  the  same 
direction,  the  bookkeeping  is  simple  and  can  be  easily  put  on  the  parallel  computer.  The 
other  important  modification  in  this  subroutine  for  the  parallel  version  is  that  a 
"Collisions  directive"  is  used  to  accumulate  arrays  with  duplicate  index  values.  This  new 
parallel  array  assignment  statement  allow  the  MASPAR  to  execute  the  vector-subscribed 
value  A(IEV(:N))=A(IEV(:N))+B(:N)  the  same  as  the  serial  DO  loop  even  when 
duplicate  index  values  are  in  array  rEV(:N).  With  this  new  "Collisions"  parallel  array 
assignment,  the  speed  can  be  improved.  Because  we  avoid  the  storage  of  the 
tremendously  whole  time  history  of  FDTD  field  components,  memory  space  can  be 
reduced. 

2)  In  absorbing  boundary,  we  replace  the  second  order  Mur  absorbing  boundary  by  our 
new  artificial  tapered  damping  radiation  condition[4]  in  subroutines  ABSORBS  and 
ABSORBH.  The  advantage  of  the  tapered  damping  absorbing  condition  is  that  it  is  very 
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simple  and  more  suitable  to  massively  parallel  computing  and  more  flexible  for  non- 
rectangular  grids. 

3)  In  subroutines  FVTDE  and  FVTDH,  we  vectorize  the  summation  of  H-vorticity 
(FVTDE)  and  E-vorticity  (FVTDH)  to  speed  up  the  computation.  Since  the  computation 
time  in  FVTDE  depends  on  the  maximum  number  of  triangles  around  a  vertex,  we  also 
use  our  newly  developed  unstructured  mesh  generator  to  create  more  uniform  surface 
grids.  The  speed  improves  as  the  maximum  number  of  triangles  around  a  vertex 
decreases.  In  Subroutine  FVTDINCB,  the  boundary  condition  simulation  is  performed. 
There  are  different  approximations  for  the  boundary  condition  as  discussed  in  [3],  here 
we  use  PECl  because  it  is  more  suitable  for  parallel  computing.  Due  to  the  complicated 
dependency  graph  of  the  unstructured  grid,  the  calculations  of  area  normal  of  the  facet  of 
dual  or  primary  cells  are  very  time  consuming.  In  order  to  improve  the  computing  speed, 
the  area  normal  vectors  are  pre-calculated  in  the  initial  step  of  each  subroutine  and  are 
stored  into  the  memory. 

4)  Subroutines  INTPCTR,  INTPRTCE  and  INTPRTCH  calculate  the  interpolated  data 
from  curvilinear  to  rectangular  grids  or  from  rectangular  to  curvilinear  grids.  Since  aU  the 
coefficients  are  pre-calculated  in  the  preprocessing  program,  the  parallelization  of  these 
subroutines  is  easy. 

EVALUATION  OF  THE  PARALLEL  CODE  AND  NUMERICAL  RESULTS 


The  numerical  simulation  is  carried  out  with  PEC  spheres  of  different  sizes.  Comparisons 
of  the  computing  time  between  the  parallel  code  on  the  MASPAR  1104  and  the  serial 
code  on  the  SUN  workstation  Sparc  10  and  Sparc  IPX  are  based  on  the  same  condition. 
The  radius  of  the  sphere  ranged  from  0.5  meters  to  4  meters.  The  grid  size  is  delx=0.1m. 
The  outer  boundary  of  the  FDTD  box  is  20  zones  from  the  surface  of  the  sphere.  The 
taper  damping  absorbing  boundary  is  10  zones  from  the  outer  FDTD  box.  The  number  of 
triangles  for  the  1  meter  sphere  is  2400,  for  the  2  meter  sphere  is  9600,  etc.  Figure  1 
shows  comparisons  of  computing  time  for  each  iteration  on  serial  and  parallel  machines. 
Table  1  is  the  distribution  of  timing  of  each  subroutine  in  the  parallel  code  on  the 
MASPAR  1104  for  a  sphere  with  radius  Im  and  2m.  Table  2  is  the  distribution  of  timing 
of  each  subroutine  for  a  sphere  with  radius  Im  and  4m  run  on  MASPAR  1104  and  Sparc 
10.  Figure  2  is  the  RCS  of  a  1  meter  sphere  comparing  with  the  Mie  series.  Note  the 
improved  accuracy  over  classical  "staircasing"  FDTD  techniques. 
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Figure  I.  Computing  time  pre  iteration  vs,  sphere  radius. 
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Figure  2.  RCS  of  a  1  meter  sphere  comparing  with  Mie  series. 
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Abstract 

CARLOS-3D  is  a  three-dimensional  scattering  code  which  was  developed  under  the 
sponsorship  of  the  Electromagnetic  Code  Consortium  (EMCC),  and  is  currently  used  by 
over  80  aerospace  companies  and  government  agencies.  The  code  has  been  extensively 
validated  and  runs  on  both  serial  workstations  and  parallel  super  computers  such  as  the 
Intel  Paragon.  CARLOS-3D  is  a  three-dimensional  surface  integral  equation  scattering 
code  based  on  a  Galerkin  method  of  moments  formulation  employing  Rao-Wilton-Glisson 
roof-top  basis  for  triangular  faceted  surfaces.  Fully  arbitrary  3D  geometries  composed  of 
multiple  conducting  and  homogeneous  bulk  dielectric  materials  can  be  modeled.  Thin 
materials  can  be  modeled  as  either  bulk  regions  or  using  various  approximate  boundary 
conditions  such  as  impedance  surfaces  or  resistive  and  magnetically  conducting  sheets. 
Matrix  symmetry  and  geometric  symmetry  are  used  extensively  in  the  code  in  order  to 
reduce  the  computational  and  memory  requirements  for  large  scatterers.  A  unique  feature 
of  the  code  is  the  Galerkin  operator  structure  of  the  code  which  is  based  on  an  indexing 
scheme  which  relates  basis  functions  to  unknown  numbers  in  the  matrix  equation.  All  of 
the  integral  equation  formulations  are  implemented  using  this  structure,  and  are  based  on 
six  Galerkin  matrix  operators.  This  operator  based  structure  is  independent  of  the 
geometry  representation  and  basis  functions  used. 

In  this  presentation,  we  will  describe  some  of  the  extensions  to  the  CARLOS-3D  code, 
and  how  the  operator  structure  of  the  code  facilitated  these  improvements.  Body  of 
revolution  (BOR)  and  two-dimensional  geometries  were  incorporated  by  simply  including 
new  input  routines,  and  the  appropriate  Galerkin  matrix  operator  routines.  Some 
additional  modifications  were  required  in  the  combined  field  integral  equation  (CFIE) 
matrix  generation  routine  due  to  the  symmetric  nature  of  the  BOR  and  2D  operators. 
Quadrilateral  patched  surfaces  with  linear  roof-top  basis  functions  were  also  implemented 
in  the  same  manner.  Quadrilateral  facets  and  triangular  facets  can  be  used  in  combination 
to  more  efficiently  model  geometries  with  both  large  smooth  surfaces  and  surfaces  with 
fine  detail  such  as  gaps  and  cracks.  Since  the  parallel  implementation  in  CARLOS-3D  is 
at  high  level,  these  changes  were  independent  of  the  computer  platform  being  used.  This 
approach  minimizes  code  maintenance,  while  providing  capabilities  with  little  additional 
effort. 

Results  will  be  presented  showing  the  performance  and  accuracy  of  the  code  for  some 
large  scattering  problems.  Comparisons  between  triangular  faceted  and  quadrilateral 
faceted  geometry  representations  will  be  shown  for  some  complex  scatterers. 
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Introduction 

CARLOS-3D  is  a  general-purpose  method  of  moments  (MM)  code  [1]  for  computing  the 
scattering  from  complex  three-dimensional  scatterers.  It  is  based  on  a  McDonnell 
Douglas  Aerospace  proprietary  code  CARLX)S,  which  models  antenna  and  scattering 
problems  for  2D  and  3D  geometries.  CARLOS-3D  was  developed  under  the  sponsorship 
of  the  Electromagnetics  Code  Consortium  (EMCC),  and  is  available  to  qualified 
organizations  through  them,  subject  to  US  export  control  laws.  The  code  uses  the  MM 
tedinique,  with  Galerldn  testing  to  solve  the  Stratton-Chu  surface  integral  equations  for  a 
user  specified  geometry.  All  of  the  surfaces  describing  the  scatterer,  consisting  of 
conducting  surfaces  and  boundaries  between  different  dielectric  regions  are  replaced  with 
equivalent  electric  (J)  and  magnetic  (M)  currents.  The  code  solves  for  these  induced 
equivalent  currents,  which  are  then  used  to  compute  the  scattered  far-fields. 


Code  Description 

Arbitrary  3D  surfaces  are  modeled  using  flat  triangular  facets,  with  the  electric  and 
magnetic  currents  expanded  in  terms  of  the  Rao,  Wilton,  Glisson  roof-top  functions  [2]. 
A  roof-top  function  spans  the  two  facets  forming  each  interior  edge  of  a  surface.  At 
junction  edges,  which  are  formed  by  the  intersection  of  two  or  more  surfaces,  half  roof¬ 
top  expansion  functions  are  used  to  expand  the  currents.  Current  continuity  across  a 
junction  is  enforced  by  equating  the  unknowns  associated  with  each  of  the  half  roof-top 
functions  for  the  edge.  Junction  edges  are  determined  automatically  for  surfaces  which 
have  common  node  points  along  a  line  of  intersection  between  surfaces. 

Complex  geometries,  which  are  composed  of  multiple  conducting  and  bulk  dielectric 
regions  can  be  modeled.  Various  boundary  conditions  can  be  imposed  separately  on  each 
of  the  surfaces  comprising  the  target.  For  a  given  geometry,  each  dielectric  region  is 
given  a  number,  and  every  surface  which  forms  a  boundary  between  different  regions  must 
be  entered  as  a  faceted  surface.  Infinitesimally-thin  conducting,  resistive,  and  impedance 
sheets  must  also  be  defined.  The  user  must  specify  both  the  interior  and  exterior  regions 
for  each  surface,  along  with  the  boundary  condition  to  impose.  Resistive  and 
magnetically-conducting  boundaries  can  either  be  embedded  in  a  region,  or  be  defined  on 
the  interface  between  different  regions.  Tapered  resistive  and  impedance  surfaces  are 
modeled  by  specifying  values  for  each  facet  forming  the  surface.  Impedance  (Leontovich) 
boundary  conditions  are  modeled  as  an  equivalent  combination  resistive/magnetically- 
conducting  boundary. 

The  Stratton-Chu  integral  equations  can  be  solved  using  several  different  formulations. 
For  conducting  surfaces,  either  the  electric  field  integral  equation  (EFIE),  the  magnetic 
field  integral  equation  (MFIE),  or  the  combined  field  integral  equation  (CFIE)  can  be 
used.  The  coupling  parameter  in  the  CFIE  formulation  is  specified  separately  on  each 
surface,  allowing  the  formulation  to  be  used  for  geometries  with  both  open  and  closed 
surfaces.  For  dielectric  boundaries,  the  PMCHW  formulation  (after  Poggio,  Miller,  Chu, 
Harrington,  and  Wu)  is  used.  Galerkin  testing,  in  conjunction  with  the  form  of  the  current 
expansions,  results  in  a  symmetric  system  of  equations  whenever  the  PMCHW/EFIE 
formulation  is  used.  In  addition,  the  formulations  implemented  for  treated  surfaces  also 


229 


result  in  symmetric  matrices  which  require  only  half  of  the  matrix  elements  to  be  computed 
and  stored. 

Scattering  from  apertures,  cavities,  and  gaps  in  a  conducting  surface  can  be  modeled  using 
an  infinite  ground  plane  option.  Image  theory  is  used  to  model  both  the  source  and  the 
induced  currents  on  all  of  the  surfaces  which  are  either  conducting,  dielectric  or  treated. 
The  resulting  system  of  equations  is  again  symmetric. 

Code  Structure 

The  CARLOS-3D  code  has  a  flexible  and  modular  structure  which  facilitates  the 
incorporation  of  new  features.  The  major  components  of  the  code  are  independent  of  the 
surface  representation  (i.e.,  flat  facets,  quadrilaterals,  curved  surfaces,  etc.),  and  of  the 
basis  functions  which  are  used  to  approximate  the  surface  currents,  permitting  the  code  to 
be  easily  adapted  to  advanced  basis  functions.  The  section  of  code  which  generates  the 
system  matrix  for  an  arbitrary  geometry  is  written  in  terms  of  a  generalized  Galerkin 
matrix  operator  notation  [3].  These  operators  result  from  testing  either  the  integral 
operators  in  the  Stratton-Chu  equations,  the  equivalent  currents  directly,  or  the  incident 
fields.  The  subroutines  which  assemble  the  system  matrix  and  right-hand-side  vector  refer 
to  these  generic  operators.  The  generic  operators  then  reference  routines  which  are 
specifically  written  for  a  given  basis  function  type.  Only  the  geometry  input  routine,  and 
these  specialized  Galerkin  operator  routines  depend  upon  the  surface  representation  and 
basis  functions  used.  Symmetry  relations,  which  can  be  established  for  the  Galerkin  matrix 
operators,  are  used  to  efficiently  fill  the  system  matrix  by  eliminating  redundant 
calculations. 

A  systematic  approach  [4]  is  used  to  generate  the  matrix  equation,  ZI^V,  for  an  arbitrary 
geometry  with  various  boundary  conditions  imposed  on  the  surfaces.  This  approach  is 
based  on  a  simple  indexing  scheme,  which  assigns  an  index  number  to  each  edge  (roof-top 
or  half  roof-top  function)  defining  the  entire  geometry.  The  index  number  for  a  particular 
edge  specifies  the  location  within  the  column  vector,  I,  of  either  the  J  or  M  current 
coefficient  associated  with  that  edge.  This  indexing  is  performed  in  the  geometry  input 
routine,  and  is  based  on  the  boundary  condition  which  is  imposed  on  each  surface.  The 
boundary  condition  is  used  to  define  the  equivalent  currents  which  reside  on  the  surface, 
and  the  relationship  between  the  interior  and  the  exterior  current  coefficients.  Current 
continuity  across  junction  edges  connecting  separate  surfaces  is  enforced  by  equating  the 
indices  associated  with  the  half  roof-top  functions  for  the  edge.  The  matrix  assembly 
routine  is  based  on  the  index  associated  with  each  basis  function  (edge),  and  not  on  the 
explicit  form  of  the  basis  function  or  the  surface  representation. 

Quadrilateral  Patch  Implementation 

A  quadrilateral  patch  formulation  [5]  was  incorporated  into  the  parallel  version  of  the 
code.  In  the  quad-patch  formulation,  the  currents  on  each  patch  are  represented  in  terms 
of  local  parametric  coordinates.  For  large  smooth  surfaces,  the  parametric  coordinate 
directions  are  nearly  orthogonal  and  the  formulation  has  been  shown  to  require  fewer 
unknowns.  This  formulation  employs  a  quadrilateral  patched  mesh  to  represent  the 
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surface  geometry  and  uses  linear  roof-top  basis  functions  which  span  the  pair  of  quad 
patches  which  form  each  interior  edge  of  the  grid.  These  basis  function  are  edge  based, 
and  are  implemented  in  manner  similar  to  the  Rao-Wilton-Glisson  (RWG)  roof-top 
functions.  In  this  implementation,  each  quad  patch  is  represented  by  four  points  on  the 
surface  of  the  scatterer,  and  a  bi-linear  surface  description  over  the  quad  is  assumed.  The 
matrix  elements  can  be  computed  in  a  manner  analogous  to  the  RWG  basis  functions  using 
a  combination  of  analytic  and  numerical  procedures  to  compute  the  self  and  near-self 
terms. 

The  modifications  to  the  parallel  code  were  straightforward.  The  input  routine  was 
modified  to  accept  either  triangular  or  quad  patches,  and  perform  the  unknown  counting 
and  indexing.  Galerkin  operator  routines  were  written  for  the  linear  basis  functions  on 
quad  patches  using  the  triangular  facet  routines  as  templates.  The  high  level  operator 
routines  were  then  modified  to  call  the  new  operator  routines  for  the  quad  patched 
surfaces. 

Hybrid  Combinations 

Hybrid  combinations  of  triangular  and  quad  meshed  surfaces  have  also  been  implemented 
into  the  code.  The  half  roof-top  functions  for  triangular  and  quadrilateral  patches  can  be 
combined  for  edges  which  are  formed  by  adjacent  triangles  and  quads.  Current  continuity 
normal  to  a  hybrid  edge  is  maintained,  and  the  representation  is  free  of  line  charges.  Note 
that  the  RWG  half  roof-top  function  is  different  than  the  half  roof-top  function  which 
occurs  for  a  degenerate  quad  patch  where  two  points  are  identical.  The  RWG  basis 
functions  appear  to  be  superior  for  this  degenerate  situation.  A  combination  of  triangular 
meshed  and  quad  meshed  surfaces  is  an  efficient  way  to  model  geometries  which  have 
both  surfaces  with  fine  detail  and  large  smooth  surfaces.  Galerkin  operator  routines  were 
written  to  compute  the  interactions  between  triangular  and  quad  surfaces. 

BOR  and  2D  Implementation 

The  subroutines  contained  in  the  CICERO  code  [6]  for  both  body  of  revolution  (BOR) 
and  two-dimensional  (2D)  geometries  were  incorporated  into  the  parallel  CARLOS-3D 
code.  This  was  a  straightforward  way  of  implementing  a  parallel  version  of  a  BOR  code 
by  taking  advantage  of  the  operator  structure  and  unknown  indexing  of  CARLOS-3D. 
Tliis  permitted  the  modeling  of  some  large  missile  shapes  using  an  efficient  formulation  for 
BORs.  In  addition,  some  of  the  formulations  and  features  of  CARLOS-3D  which  were 
not  contained  in  the  CICERO  code  were  available  with  little  additional  effort. 

The  input  routines  for  BOR  and  2D  geometries  were  incorporated  into  the  code  with  only 
minor  modifications.  The  Galerkin  operator  routines  were  then  integrated  into  the  code 
with  some  additions  added  relating  to  the  block  matrix  fill  used  in  CARLOS-3D  for  the 
parallel  implementation.  The  generic  matrix  assembly  routines  were  then  modified  to  call 
these  new  routines  for  BOR  of  2D  surfaces.  Some  additional  changes  were  made  to  take 
advantage  of  the  matrix  symmetry  which  results  from  the  BOR  and  2D  formulations  due 
to  the  orthogonal  components  of  the  basis  frinctions. 
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Results 

The  results  for  a  number  of  selected  problems  will  now  be  shown.  All  of  the  results  were 
generated  on  the  Intel  Paragon  at  Sandia  National  Laboratories.  The  first  problem  to  be 
shown  is  the  scattering  of  the  upper  stages  of  a  STRYPI  missile  at  1  GHz  using  the 
triangular  patch  representation  and  the  BOR  representation  of  the  missile.  The  upper 
stages  of  this  missile  is  5.7m  long  with  a  circular  cross  section  of  radius  0.39m.  The 
triangular  patch  model  was  generated  using  two  symmetry  planes  resulting  in  a  total  of 
62172(15798,15543,15543,15288)  unknowns.  The  spatial  resolution  of  this  model  was 
309  facets/X^.  The  BOR  representation  of  this  scatterer  consisted  of  54.6  segments/X 
which  resulted  in  1124  unknowns  per  Fourier  mode.  Thirty  Fourier  modes  were  used 
along  with  a  61  point  Gaussian  quadrature.  The  results  of  these  two  calculations  are 
shown  in  Figure  1.  and  it  is  pointed  out  that  the  combined  field  formulation  was  used  for 
both  calculations. 


Figure  1.  BOR  and  Triangular  Patch  monostatic  RCS  for  STRYPI  at  IGHz,  Theta 
polarization. 


The  results  clearly  show  excellent  comparison  between  the  two  methods  with  a  slight 
discrepancy  at  nose  on  (or  tip  aspect).  The  solution  times  for  the  two  methods  are 
presented  in  Table  1. 
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Formulation 

#  of  Nodes 

Matrix  Fill 

Matrix  Solve 

Total  Time 

BOR 

512 

.1 

Triangular  Patch 

768  n 

1.0 

.26 

2.16 

Table  1.  Solution  Times  for  BOR  and  Patch  formulations(  all  times  are  in  hours). 

The  second  problem  addresses  the  triangular  versus  the  quad-patch  formulation  of  the 
scatterer.  The  test  scatterer  for  this  case  is  the  1  meter  almond  at  2  GHz.  The  geometry 
and  unknown  information  for  this  model  is  shown  in  Table  2. 


Formulation 

Nodes 

Facets 

Total  Number  of 
Unknowns 

Quad 

■SI 

14976 

Wk\SiiM 

Table  2.  Geometry  and  unknown  information  for  quad  and  triangular  patch  formulation. 

Note  that  the  geometry  information  is  actually  for  the  quarter  model  of  the  almond  so  two 
planes  of  symmetry  are  used  and  the  resulting  total  number  of  unknowns  is  approximately 
four  times  the  quarter  model  unknowns.  The  monostatic  res  is  shown  in  Figure  2  for 
theta  polarization  calculated  using  the  EFIE. 


Figure  2.  RCS  of  the  1  meter  almond  at  2  GHz  for  the  two  formulations  using  the  EFIE. 
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The  above  results  clearly  show  excellent  agreement  between  the  two  formulations.  The 
solution  times  for  these  runs  are  collected  in  Table  3.  Note  that  the  problems  were  run  on 
64  nodes  of  the  Intel  Paragon. 


Formulation 

Matrix  Fill 

Matrix  Solve 

Quad 

.16 

.16 

Triangle 

.14 

.29 

Table  3.  Solution  times  for  the  quad  and  triangular  patch  models(time  in  hr.). 


Conclusions 

The  previous  parallelization  effort  on  CAR1X>S-3D  resulted  in  a  code  that  permitted  new 
features  to  be  added  independent  of  the  parallel  implementation[7].  Two  new  features 
have  been  added  namely  the  BOR  formulation  and  the  quad-patch  version  of  the  code. 
These  features  were  easily  added  due  to  the  original  structure  of  the  code.  These  features 
have  been  validated  for  a  number  of  large  test  cases  a  few  of  which  have  been  presented 
here. 
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Abstract 

In  contrast  to  the  TLM  scheme  for  the  symmetrical  condensed  node  (SCN),  the  field 
variables  of  the  alternating  transmission  line  matrix  scheme  (ATLM)  are  sampled  at 
the  center  of  the  three-dimensional  TLM  cell.  Considering  the  spatial  derivatives  of  the 
electric  and  magnetic  field  components  yields  a  bijective  one-to-one  mapping  between 
the  six  field  components,  six  spatial  derivatives  of  the  field  components  and  the  twelve 
wave  amplitudes.  ATLM  does  not  support  any  nonphysical  or  spurious  solutions  and 
it  represents  the  only  TLM  scheme  with  the  same  number  of  linearly  independent  field 
variables  per  cell  as  Yee’s  FDTD  scheme. 


I  Introduction 

The  close  relationship  between  the  TLM  and  FDTD  method  has  been  shown  recently  by 
deriving  both  methods  directly  from  Maxwell’s  equations  [1,  2,  3].  Applying  the  method  of 
moments  to  Maxwell’s  equations  yields  a  finite  difference  scheme  depending  on  the  choice 
of  the  subdomain  basis  functions.  While  for  FDTD,  the  field  variables  are  the  electric  and 
magnetic  field  components,  TLM  uses  incident  and  scattered  wave  amplitudes  instead.  The 
wave  amplitudes  represent  a  linear  combination  of  the  field  components  allowing  to  rewrite 
the  finite  difference  scheme  in  a  form  in  which  the  scattered  wave  amplitudes  at  the  time 
{k  +  l)At  are  only  dependent  on  the  incident  wave  amplitudes  at  kAt.  This  relationship 
between  the  incident  and  reflected  wave  amplitudes  defines  a  scattering  matrix  which  describes 
the  properties  of  a  multiport  and  thus  determines  an  equivalent  network  for  the  TLM  cell. 

The  network  model  of  TLM  has  become  useful  in  many  applications,  however  it  also 
results  in  nonphysical  solutions  which  do  not  satisfy  Maxwell’s  equations  [1].  The  presence  of 
nonphysical  or  spurious  solutions  limits  the  applicability  of  the  TLM  method  and  indicates  the 
use  of  unnecessary  field  variables.  In  fact,  it  has  been  shown  that  for  the  conventional  three- 
dimensional  TLM  schemes,  half  of  the  field  variables  do  not  contribute  to  the  calculation 
of  physical  solutions  indicating  the  unnecessary  use  of  half  of  the  field  variables  [3].  In  this 
paper,  the  foundations  of  the  novel  alterning  transmission  line  matrix  scheme  (ATLM)  [4]  are 
presented.  ATLM  does  not  support  any  spurious  solutions  and  therefore,  it  does  not  exhibit 
a  redundancy  factor  of  two  as  conventional  three-dimensional  TLM. 

Since  ATLM  is  based  on  the  TLM  scheme  for  the  symmetrical  condensed  node  (SCN)  [5], 
its  field  theoretical  derivation  is  similar  to  the  field  theoretical  derivation  of  the  SCN  [2]. 
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However,  the  sampling  of  the  field  components  is  different  for  the  two  TLM  schemes:  While 
for  TLM  with  SON,  every  three-dimensional  cell  with  the  spatial  coordinates  L,  m,  n  is  sampled 
at  every  time  sampling  point  kAt  in  the  cell  boundaries  [2],  for  ATLM,  either  the  odd  cells 
with  odd  ^  +  m  +  n  +  A:  or  the  even  cells  with  even  l  +  m  +  n-j-  k  are  sampled  in  the  center  of 
the  three-dimensional  cell  [4].  This  sampling  leads  to  a  bijective  one-to-one  mapping  between 
the  six  field  components,  six  spatial  derivatives  of  the  field  components  and  the  twelve  wave 
amplitudes.  Each  one  of  the  six  spatial  derivatives  represents  the  sum  of  the  derivatives  in  the 
two  directions  perpendicular  to  a  field  component.  This  sum  of  the  derivatives  is  called  the 
tangential  derivatives  of  a  field  component. 


II  Derivation  of  ATLM 

We  proceed  in  the  same  way  as  demonstrated  for  the  SCN  [2]  and  expand  the  electric  and 
magnetic  field  components  in  the  same  series  of  basis  functions.  Inserting  the  field  expansions 
in  Maxwell’s  equations  and  sampling  the  equations  in  the  odd  and  even  cells  yields  two  cou¬ 
pled  finite  differences  schemes  for  the  electric  and  magnetic  field  components.  Applying  the 
cell  boundary  mapping  [2],  we  obtain  two  finite  difference  schemes  for  the  wave  amplitudes 
describing  the  scattering  of  the  wave  amplitudes  at  the  even  and  at  the  odd  nodes. 

Using  the  state-space  representation  of  the  electromagnetic  field  [2,  6],  we  have 

|6,)=rS|a,)  (1) 

with  v  =  e,o.  For  the  index  e,  eq.  (1)  describes  the  scattering  at  the  nodes  for  which  the  sum  of 
the  spatial  coordinates  I,  m,  n  of  the  cell  is  even,  for  the  index  o,  eq.  (1)  describes  the  scattering 
at  the  nodes  for  which  the  sum  of  the  spatial  coordinates  I,  m,  n  of  the  cell  is  odd.  The  vectors 
|ut,)  summarize  all  incident  wave  amplitudes,  the  vectors  |6u)  all  scattered  wave  amplitudes  of 
the  TLM  mesh  at  all  time  sampling  points  kAt.  S  represents  the  scattering  matrix  of  the  SCN 
and  T  the  time  shift  operator  indicating  that  the  scattering  at  a  node  is  connected  with  the 
time  delay  At.  For  each  cell  boundary  surface,  the  wave  amplitudes  incident  into  one  TLM 
cell  are  identical  with  the  wave  amplitudes  scattered  from  the  neighboring  TLM  cells.  This 
relation  is  expressed  by  the  connection  operator  F  [2,  3]  via 

|n.)  =  r  \K)  .  (2) 

The  connection  operator  T  is  a  matrix  of  space  shift  operators  [6]. 

As  explained  in  [4],  for  homogeneous  regions,  the  scattering  and  propagation  of  the  wave 
amplitudes  at  the  even  cells  and  at  the  cells  for  which  the  sum  of  the  spatial  coordinates  I,  m, 
n  and  the  time  index  k  is  even,  respectively,  is  independent  of  the  scattering  and  propagation 
of  the  wave  amplitudes  at  the  odd  cells.  We  combine  both  independent  processes  by  defining 
the  vectors  of  the  incident  and  scattered  wave  amplitudes  at  the  even  and  odd  nodes, 


|a.)  =  1/2  (1  +  T)  |a„> 

(3) 

|6.)  =  1/2  (1  +  T)  W 

(4) 
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Eq.  (1)  yields 


l&v)  =  TS  ja,)  (5) 

for  the  scattering  at  the  even  and  odd  nodes.  For  the  propagation  of  the  wave  amplitudes  from 
the  even  nodes  to  the  odd  nodes  and  vice  versa,  we  obtain 

[a,)  =  r  \be)  (6) 

and 

|a.>  =  r  |6<.>  .  (7) 

Eliminating  the  wave  amplitudes  at  the  odd  nodes  as  well  as  the  scattered  wave  amplitudes 
at  the  even  nodes  yields 

la,)  =  T^(rsf  |a.)  .  (8) 

The  latter  equation  represents  ATLM:  the  twelve  linearly  independent  wave  amplitudes  are 
defined  only  at  the  even  nodes.  Thus  there  are  twelve  linearly  independent  field  variables  for 
two  TLM  cells  and  for  one  ATLM  cell,  respectively,  or  six  linearly  independent  field  variables 
per  TLM  cell.  This  is  the  same  ratio  as  for  FDTD  and  therefore  ATLM  does  not  exhibit  any 
redundancy  as  it  is  the  case  for  conventional  three-dimensional  TLM  [3]. 

Fig.  1(a)  illustrates  the  elemenatary  cell  for  ATLM.  While  for  conventional  three- 
dimensional  TLM,  the  elementary  cell  is  a  cube  with  a  side  length  of  Al  and  a  volume  of 
(Al)^,  the  ATLM  elementary  cell  is  a  rhomboedron  with  a  volume  of  2{Al)^  and  with  a 
volume  of  two  TLM  cells,  respectively. 


Ill  Mapping  between  the  field  components  and  the 
wave  amplitudes 

The  cell  boundary  mapping  introduced  for  the  TLM  scheme  for  the  SCN  [2]  is  not  applicable 
for  ATLM,  since  it  relates  the  electric  and  magnetic  field  components  to  the  wave  amplitudes 
of  both  odd  and  even  nodes.  In  order  to  obtain  a  mapping  which  relates  the  field  components  to 
the  wave  amplitudes  of  either  the  even  or  the  odd  cells,  the  field  components  have  to  be  sampled 
at  the  center  of  the  even  or  odd  cells.  In  this  section,  we  derive  a  bijective  one-to-one  mapping 
between  the  field  components  and  their  spatial  derivatives  at  the  center  of  the  even  cells  and 
the  wave  amplitudes  incident  into  the  even  nodes.  This  is  done  by  sampling  the  expansions  of 
the  electric  and  magnetic  field  components  in  the  same  way  as  sampling  Maxwelfis  equations 
for  the  derivation  of  ATLM  and  the  TLM  scheme  for  the  SCN,  respectively.  As  shown  in  [2], 
Maxwell’s  equations  have  to  be  sampled  with  delta  functions  as  well  as  with  the  sum  of  their 
spatial  derivatives  in  order  to  determine  the  scattering  matrix  of  the  SCN  uniquely.  A  detailed 
discussion  of  sampling  Maxwell’s  equations  with  the  sum  of  the  spatial  derivatives  of  the  delta 
functions  can  be  found  in  [7], 

We  define  the  electric  and  magnetic  field  components  at  the  center  of  the  even  ATLM  cells 

by 

M/2Fri.„  =  jjfj  F,(x,t)S(t-(k  +  \l2)lit) 
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X  6{x  -  lAl)  6{y  -  mAl)  8{z  -  nAl)  dx  dy  dz  dt 


k+l/2F^,kn  = 


Fy{x,t)  6{t-ik  + 1/2) At) 

X  6{x  ~  lAl)  6{y  -  mAl)  6{z  -  nAl)  dx  dy  dz  dt 


-  JjJ]  FMt)5{t-{k +  1/2) At) 

X  6{x  -  lAl)  6{y  -  mAl)  6{z  -  nAl)  dx  dy  dz  dt  (9) 

with  F  =  E,H.  Each  of  the  four  integrations  extends  from  -oo  to  +oo  and  I +  m  +  n  +  k  is 
an  even  number.  Inserting  the  field  expansions  given  in  [2]  yields 

|F,)  =  Pi  |a.)  (10) 

with  the  operator 
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The  vector 


-  E 

k,l,m,n=-c 


\k]  l,m,n) 


ZkHf^n 

.  Z  kH/Xn  J 

is  a  vector  in  the  field  state  space  Tip  [2,  3],  It  summarizes  the  field  components  at  the 
even  ATLM  nodes  of  the  complete  mesh  at  all  time  sampling  points  kAt  The  impedance  Z 
represents  the  wave  impedance  of  the  free  space.  Eq.  (10)  describes  the  mapping  of  the  wave 
amplitudes  incident  into  the  even  nodes  onto  the  six  electric  and  magnetic  field  components 
at  the  center  of  the  even  ATLM  cells.  This  mapping  is  identical  to  the  traditional  mapping 
between  the  incident  wave  amplitudes  and  the  field  components  [5],  however,  as  discussed 
in  [2],  it  is  not  bijective. 

In  order  to  obtain  a  bijective  one-to-one  mapping,  the  spatial  derivatives  of  the  field 
components  at  the  center  of  the  ATLM  cells  have  to  be  considered.  We  sample  the  expansions 
of  the  electric  and  magnetic  field  components  with  the  sum  of  the  spatial  derivatives  of  the 
delta  functions  in  the  directions  perpendicular  to  the  field  components.  Thus  we  define  the 
tangential  derivatives  of  the  electric  and  magnetic  field  components  at  the  center  of  the  even 
ATLM  cells  by 

_  Al  rrrr  /  d  .  5  \  c/. 


^  +  6(t-{k  + 1/2) At) 
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X  6{x  -  lAl)  6{y  —  mAl)  6{z  —  nAl)  dx  dy  dz  dt 

=  f  Jill  (I  +  I)  F,(f.  t)  Sit  -  ik  +  1/2)  At) 

X  6{x  —  lAl)  6{y  —  mAl)  S{z  —  nAl)  dx  dy  dz  dt 


with  the  inverse  mapping 

(18) 

where 

Qi  =  Pf  and  Q^^Pl 

(19) 
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IV  Dispersion  analysis  for  ATLM 


As  described  in  [3],  eq.  (8)  requires 


det 


(rsf 


=  0 


(20) 


in  both  frequency  and  wave  vector  domains  for  any  non-trivial  solution  of  TLM.  The  nor¬ 
malized  frequency  Q  is  given  by  =  27:Atf  and  the  connection  operator  in  wave  vector 
domain,  T,  represents  a  twelve-dimensional  matrix  with  the  elements  and  where 

X=  Alkx.,r}  =  Alky  and  ^  =  Alk^  are  the  normalized  wave  vector  components  of  the  wave 
vector  k  =  [kx,  ky,  Prom  eq.  (20),  we  obtain  twelve  eigenvalues  given  by 


Ai,2  =  Cl  +  {Ci)^  —  1  A3_4  —  Cl  —  -^(Ciy^  1  Ase  —  1  (21) 


with 

Cl  =  i  (  cos(x)  cos(rj)  +  cos(x)  cos(0  +  cos(7?)  cos(0  -  l)  •  (22) 

The  eigenvalues  A7, . . . ,  A12  are  identical  with  Ai, . . . ,  As-  All  the  eigenvectors  of  these  twelve 
eigenvalues  describe  physical  solutions  and  solutions  converging  to  solutions  of  Alaxwell's  equa¬ 
tions  for  frequencies  approaching  zero,  respectively.  Thus  there  are  no  more  nonphysical  so¬ 
lutions  that  means  no  artifacts  introduced  by  the  discretization  of  Maxwell’s  equations  as  in 
conventional  three-dimensional  TLM  [3].  Note  that  if  we  calculate  the  eigenvalues  of  FDTD, 
we  obtain  six  physical  eigenvalues  [3].  Four  eigenvalues  describe  the  propagating  solutions  and 
two  eigenvalues  describe  the  electro-  and  magnetostatic  case.  If  we  consider  the  eigenvalues  for 
FDTD  for  two  subsequent  time  steps,  we  obtain  twelve  eigenvalues,  where  eight  eigenvalues 
describe  the  propagating  solutions  and  four  eigenvalues  the  electro-  and  magnetostatic  case. 
This  corresponds  to  the  situation  in  ATLM,  where  we  also  have  eight  eigenvalues  describing 
the  propagating  solutions  and  four  eigenvalues  describing  the  electro-  and  magnetostatic  case. 
This  confirms  the  results  of  the  second  section:  Both  ATLM  and  FDTD  exhibit  six  linearly 
independent  field  variables  per  cell. 

The  eigenvectors  corresponding  to  the  eigenvalues  Ai, . . . ,  A4  describe  solutions  propagating 
in  a  mesh  of  ATLM  nodes.  For  these  solutions,  we  obtain  the  dispersion  relation 

cos^(Q)  =  i  ( cos(x)  cos{rj)  +  cos(x)  cos(^)  -h  cos(r?)  cos(^)  ■+  l)  (23) 

which  is  the  same  as  for  the  SON  [3].  Therefore,  the  dispersion  relation  exhibits  the 
same  ambiguity  for  solutions  approaching  zero  for  modes  with  k  «  [0,0,0]^  and  with 
k  ^  [7r/A/,7r/A^,7r/A^]'^.  For  the  SON,  this  ambiguity  leads  to  the  appearance  of  spurious 
modes  [8].  For  ATLM  however,  spurious  modes  cannot  be  excited,  since  the  range  of  the  wave 
numbers  for  ATLM  is  smaller  than  the  range  of  the  wave  numbers  for  conventional  TLM.  This 
is  obvious  from  the  first  Brillouin  zone  corresponding  to  the  ATLM  elementary  cell  depicted  in 
Fig.  1(b).  While  the  first  Brillouin  zone  corresponding  to  the  TLM  elementary  cell  is  again  a 
cube  with  a  side  length  of  2Tr/Al  and  a  volume  of  {2n/Alf,  the  first  Brillouin  zone  correspond¬ 
ing  to  the  ATLM  elementary  cell  is  a  polyhedron  with  a  side  length  of  27r/A/  with  only  half  of 
the  volume  of  the  first  TLM  Brillouin  zone.  Thus  for  wave  propagation  in  (1, 1, 1)  direction, 


243 


we  have  k  <  [7r/A/,7r/A/,7r/A/]^  for  TLM,  but  only  k  <  [7r/(2Al),7r/(2A/),7r/(2A^)]^  for 
ATLM.  The  latter  condition  excludes  the  appearance  of  spurious  modes  in  ATLM.  Fig  2  illus¬ 
trates  the  effect  of  the  smaller  volume  of  the  ATLM  Brillouin  zone  on  the  dispersion  diagrams 
for  wave  propagation  in  (1, 1, 1)  direction  with  x  =  V  ~ 

V  Conclusions 

While  the  field  theoretical  derivation  of  ATLM  is  very  similar  to  the  field  theoretical  derivation 
of  TLM  with  SON,  the  mapping  relating  the  field  components  to  the  wave  amplitudes  is 
different.  Sampling  the  expansions  of  the  electric  and  magnetic  field  components  in  the  center 
of  the  ATLM  cells  yields  a  bijective  one-to-one  mapping  between  the  six  field  components, 
six  spatial  derivatives  of  the  field  components  and  the  twelve  wave  amplitudes.  One  part  of 
this  mapping,  the  mapping  of  the  twelve  wave  amplitudes  onto  the  six  electric  and  magnetic 
field  components  is  identical  to  the  traditional  mapping  proposed  for  TLM  with  SON  [5] .  The 
other,  new  part  of  the  mapping  is  necessary  in  order  to  obtain  a  bijective  mapping  for  the 
twelve  field  variables. 

As  shown  by  a  dispersion  analysis,  ATLM  does  not  support  any  nonphysical  or  spurious 
solutions  and  therefore,  it  does  not  exhibit  a  redundancy  factor  of  two  as  conventional  three- 
dimensional  TLM.  From  a  field  theoretical  point  of  view,  ATLM  represents  the  only  TLM 
scheme  which  does  not  exhibit  any  disadvantages  in  comparison  to  Yee’s  FDTD  scheme. 
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Abstract 

A  novel  node  for  TLM  symmetrical  condensed  schemes  is  presented,  which  unlike  existing  nodes, 
contains  link  lines  with  the  characteristic  impedance  equal  to  the  intrinsic  impedance  of  modelled  medium. 
Open  and  short-circuit  stubs  are  loaded  into  this  node  in  equal  proportions,  thus  ensuring  dispersion 
solutions  independent  of  the  wave  polarization.  Another  advantage  of  such  a  node  is  that  the  scattering 
at  material  interfaces  is  described  more  accurately  due  to  the  matching  impedance  condition.  The 
development  and  characterization  of  this  node  gives  more  insight  into  the  influence  of  stub-loading  on 
the  dispersion  behaviour  of  a  TLM  condensed  scheme  and  provides  a  theoretical  basis  for  the  combination 
of  stub-loading  and  alteration  of  link  line  impedances  to  develop  in  the  future  a  more  accurate,  adaptable 
node. 


Introduction 

In  recent  years,  the  conventional  TLM  based  on  the  stub-loaded  symmetrical  condensed  node  (SON)  [1] 
has  been  enhanced  by  a  wider  choice  of  nodes  offering  computational  savings  and/or  improved  accuracy. 
Notable  among  these  nodes  are  hybrid  symmetrical  condensed  nodes  (HSCN)  [2,  3]  and  the  symmetrical 
super-condensed  node  (SSCN)  [4].  In  the  stub-loaded  SCN  all  deviations  from  material  properties  and  a 
regular  cubic  mesh  are  accounted  for  by  stubs,  with  link  lines  representing  free  space  properties.  In  the 
case  of  the  HSCN  the  same  deviations  are  partly  modelled  by  stubs  and  partly  by  link  lines.  Finally,  in 
the  SSCN  all  deviations  are  modelled  by  the  link  lines  as  there  are  no  stubs  present. 

Our  main  objective  in  the  derivation  of  the  SSCN  [4]  was  to  reduce  storage  requirements  by  eliminat¬ 
ing  the  need  for  stubs.  An  additional  benefit  of  this,  as  explained  in  [5],  was  a  higher  efficiency  in  terms 
of  CPU  run-time,  compared  to  the  stub-loaded  and  hybrid  nodes.  Also,  the  time  step  allowed  in  a  graded 
mesh  was  found  to  be  substantially  higher  in  the  SSCN  than  in  the  previous  nodes  [4].  The  dispersion 
analysis  of  TLM  schemes  with  different  condensed  node  schemes  [6,  7,  8]  showed  that  the  alterations 
made  to  the  basic  12-port  SCN,  in  terms  of  adding  stubs  and  varying  the  characteristic  impedance  of 
link  lines,  caused  different  effects  in  the  dispersion  characteristics,  depending  on  how  these  alterations 
were  done.  The  most  notable  effect  in  the  stubless  SSCN  was  that  error  due  to  dispersion  increases  with 
an  increase  in  the  material  properties  (or  equivalently,  by  decrease  in  the  time  step)  [6,  7,  9].  Otherwise, 
dispersion  behaviour  of  the  SSCN  is  similar  to  that  of  the  basic  12-port  SCN:  the  solution  is  independent 
on  the  field  polarization,  the  solution  is  identical  for  eriir  —  const  and  all  errors  in  the  magnitude  of 
propagation  vector  extend  in  the  same  (positive)  direction. 
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The  most  notable  effect  of  the  stub-loading  in  the  SCN  and  the  HSCN  was  the  existence  of  two 
dispersion  solutions  corresponding  to  the  two  orthogonal  wave  polarizations.  This  ambiguity,  called  also 
mesh  anisotropy  [8],  arises  from  the  fact  that  open  and  short-circuit  stubs  are  added  to  the  node  in 
unequal  proportions,  and  because  of  this,  the  overall  coupling  of  stub  and  link  lines  with  the  electric  and 
magnetic  field  is  asymmetrical.  The  mesh  anisotropy  has  an  effect  that  two  different  orthogonal  wave 
polarizations  of  the  electric  field  propagate  in  a  TLM  mesh  based  on  the  HSCN  and  the  stub-loaded  SCN 
with  different  dispersion  characteristics.  The  only  exception  from  this  behaviour  is  the  TLM  mesh  with 
stubbed  SCN  modelling  materials  with  Sr  =  Pr-  Prom  the  dispersion  analysis  of  the  stub-loaded  SCN  [7] 
it  also  follows  that  the  dispersion  characteristics  are  dependent  on  the  wave  impedance,  i.e.  on  the  ratio 
Pr/^T  for  the  £rPr  =  cojist  which  is  in  contrast  to  physical  conditions.  For  all  these  reasons,  it  is  difficult 
to  interpret  error  trends  due  to  stub- loading. 

In  order  to  eliminate  these  undesirable  features  of  the  stub-loaded  SCN,  we  develop  here  a  new 
stub-loaded  node,  maintaining  a  constant  link  line  impedance,  but  equal  to  the  intrinsic  impedance  of 
modelled  medium  rather  than  to  the  impedance  of  the  background  medium.  This  node  will  be  referred  to 
as  the  matched  SCN  (MSCN).  Another  benefit  of  constructing  such  a  node  is  that  reflections  at  material 
interfaces  are  modelled  more  accurately.  In  the  previous  nodes,  the  impedance  of  the  link  lines  does  not 
in  general  coincide  with  the  impedance  of  the  medium  being  modelled.  The  consequence  of  this  is  that 
waves  pass  through  the  interface  without  experiencing  immediately  the  correct  amount  of  reflection  and 
transmission.  Intuitively,  it  can  be  expected  that  the  new  MSCN  models  more  accurately  the  physical 
conditions  at  the  material  interfaces  in  inhomogeneous  problems. 

The  complete  derivation  of  the  matched  SCN  from  the  unified  formulation  of  TLM  parameters  [10] 
will  be  described  in  the  next  section,  followed  by  implementation  issues  and  the  dispersion  analysis.  A 
comparison  of  errors  in  the  MSCN  with  those  found  in  other  nodes,  particularly  in  the  SSCN,  will  be 
made  and  an  emphasis  will  be  placed  on  assessing  errors  due  to  reflections  at  interface  between  different 
materials. 


Derivation  of  the  MSCN 


Prom  the  dispersion  analysis  of  the  stub-loaded  SCN  [7]  it  can  be  seen  that  the  two  physical  solutions 
of  the  dispersion  relation  become  identical  only  in  the  case  when  £r  —  Pr  (Case  2  in  [7]).  The  intrinsic 
impedance  of  the  medium  Z  equals  in  this  case  the  characteristic  impedance  of  the  link  lines  Zq,  since  it 
follows  that: 


This  observation  suggests  that  one  may  expect  that  the  dispersion  characteristics  of  a  node  having  link 
lines  whose  characteristic  impedance  is  equal  to  the  intrinsic  impedance  of  the  modelled  medium  will  be 
similar  to  those  of  the  stub-loaded  SCN  when  Sr  =  pr-  The  parameters  of  such  a  node  can  be  derived 
from  the  general  TLM  constitutive  relations,  formulated  as  [10]: 


Cik^i  +  CjkM +  Co  =  £k 

u  AzA? 

LijLi  -I-  Lji  A;  -h  =  pk  . 


(1) 

(2) 


where  i,j,k  €  {x,y,z}  and  i  ^  j,k.  In  the  above  equations,  distributed  characteristic  capacitance 
and  inductance  of  an  z-directed,  j-polarized  link  line  is  denoted  by  Cij  and  Lij,  respectively,  whilst 
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total  capacitance  and  inductance  of  open  and  short  circuit  stubs,  respectively,  modelling  fields  in  the 
A:-direction  are  denoted  by  and  X*. 

In  time  domain  schemes,  synchronism  must  be  maintained,  which  is  described  by  six  equation  in  the 
form: 

At  =  Ai^CijLij  (3) 

where  i.j  €  {x,y,  z}  and  i  ^  j.  The  system  of  twelve  equations  determined  by  the  generic  formulae  (1)- 
(3)  is  referred  to  as  the  general  system  of  equations  describing  the  parameters  of  a  TLM  time-domain 
scheme. 

The  characteristic  impedance  and  admittance  of  an  i-directed  j-polarized  line  are  defined  as  Zij  = 
l/Yij  —  yj Lij I Cij.  Taking  into  account  the  time  synchronism  condition  (3),  the  characteristic  impedance 
and  admittance  of  link  lines  can  be  written  as  function  of  the  time  step: 


In  stubs,  pulses  have  to  make  a  round  trip  during  the  time  step  At,  thus  requiring  the  effective 
transit  time  along  the  length  of  a  stub  to  be  equal  to  At/2.  Therefore,  the  characteristic  impedance 
of  short-circuit  stubs  and  the  characteristic  admittance  of  open-circuit  stubs  are  defined,  respectively 
as  [11]: 

2if  _2C.‘ 

Z,k  -  lit  -  (5) 

where  k  G  {x,y,  z}. 

Dividing  the  system  of  equations  {l)-(2)  by  At  and  using  formulae  (4)-(5),  we  obtain: 


Zij  +  Zji  + 


AiAj 

2 

^^AkAt 

Zsk 

AiAj 

2 

^’^AkAt 

In  the  MSCN,  all  characteristic  impedance  of  link  lines  are  equal  to  the  intrinsic  impedance  of  the 
medium  and  the  system  of  equations  (6)-(7)  with  the  condition  Zij  =  Z  (or  equivalently  Yij  =  Y  =  1/Z) 
reduces  in  the  case  of  an  isotropic  medium  to: 


The  characteristic  admittance  of  open-circuit  stubs  and  the  impedance  of  short-circuit  stubs  in  the 
A;-direction,  Yok  and  Zgk  respectively,  are  obtained  from  (8)  and  (9)  as: 


Zsk  —  4Z 


AiAj 

2cAtAk'' 
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(12) 


Normalizing  the  values  of  Yok  and  Z^k  to  those  of  the  link  lines  {Y,  Z)  we  obtain: 

The  scattering  matrix  for  the  MSCN  can  be  easily  derived  from  the  original  stub-loaded  SCN  scat¬ 
tering  matrix  [1]  by  taking  into  account  the  definitions  of  stub  parameters  given  by  (12).  For  the  case  of 
a  uniform  mesh,  with  Ai  =  Aj  =  Ak  ~  Al  and  At  =  AZ/(2c),  the  stub  parameters  simplify  to: 

Yo  =  Zs=4{,/i^~l)  (13) 

Since  the  MSCN  is  identical  to  the  stub-loaded  SCN  when  €r  =  they  will  have  identical  dispersion 
characteristics  in  this  case.  And  since  the  parameters  of  the  MSCN  are  independent  of  the  ratio  pltIct 
for  =  const  the  dispersion  characteristics  of  the  MSCN,  in  contrast  to  those  of  the  stub-loaded 
SCN,  will  be  independent  of  the  ratio  for  SrHr  =  const.  Furthermore,  the  dispersion  relation  of 

the  MSCN  will  have  the  identical  functional  form  as  the  dispersion  relation  of  the  stub-loaded  SCN  in 
Case  2,  derived  in  [7],  yielding  the  same  physical  solution  for  the  two  orthogonal  wave  polarizations.  The 
dispersion  characteristics  of  the  MSCN  are  therefore  advantageous  compared  to  the  stub-loaded  SCN 
because  of:  a)  the  independence  on  the  wave  impedance  (i.e.  the  factor  fir/^r)  for  =  const  and 
b)  the  existence  of  a  unique  propagating  solution  for  any  ratio  Pr/^r-  A  disadvantage  of  the  MSCN 
dispersion  characteristics  is  a  high  error  range.  Namely,  since  the  error  range  in  the  stub-loaded  SCN 
is  the  highest  in  Case  2  [7]  and  following  the  equivalence  between  the  dispersion  relations  of  the  MSCN 
and  the  Case  2  stub-loaded  SCN,  it  follows  that  the  total  error  range  in  the  MSCN  will  never  be  smaller 
than  for  the  stub-loaded  SCN,  especially  when  a  dielectric  (//^  =  1)  or  magnetic  {sr  =  1)  material  is 
modelled  (Case  1  SCN  [7]). 

However,  when  modelling  inhomogeneous  media,  the  errors  due  to  dispersion  are  found  to  be  smaller 
in  the  MSCN  than  predicted  from  the  dispersion  analysis  for  homogeneous  problems,  as  it  will  be  shown 
later.  This  behaviour  can  be  attributed  to  the  fact  that  the  impedance  of  link  lines  always  matches  the 
impedance  of  the  modelled  medium  in  the  MSCN.  Accordingly,  the  scattering  of  voltage  pulses  occurring 
at  the  interface  between  two  materials  is  directly  equivalent  to  the  scattering  of  waves  on  the  boundaries 
between  different  media.  In  the  conventional  stub-loaded  SCN,  the  voltage  pulses  pass  the  interface 
without  immediately  ‘seeing’  the  boundary,  while  in  the  HSCN  and  the  SSCN,  a  reflection/transmission 
process  occurs  due  to  different  values  of  the  link  line  characteristic  impedances  which  do  not  generally 
match  the  intrinsic  impedances  of  the  modelled  media. 

Prom  the  computational  point  of  view,  the  MSCN  is  clearly  as  demanding  as  the  stub-loaded  SCN.  In 
cases  when  only  dielectric  or  only  magnetic  materials  are  modelled,  where  the  stub-loaded  SCN  normally 
uses  only  one  type  of  stub,  the  MSCN  is  more  demanding  since  it  edways  uses  both  types  of  stubs.  It 
appears  that  a  symmetrical  use  of  both  types  of  stubs  in  the  MSCN,  expressed  through  equation  (12), 
is  a  price  which  must  be  paid  in  order  to  retain  identical  dispersion  characteristics  for  the  two  wave 
polarizations.  Due  to  the  impedance  matching  condition,  the  connection  between  neighbouring  nodes 
on  different  sides  of  a  material  interface  must  be  modelled  as  for  the  HSCN  and  the  SSCN,  as  explained 
in  [5].  The  extra  amount  of  calculations  needed  in  this  case  can  be  justified,  however,  since  the  reflections 
on  the  interface  are  in  accordance  with  problem  requirements.  Finally,  the  matching  impedance  condition 
simplifies  the  implementation  of  external  boundaries  in  TLM  based  on  the  MSCN,  since  the  line  reflection 
coefficients  will  always  be  equal  to  the  medium  reflection  coefficients  [12]. 
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Comparison  of  error  trends  in  the  MSCN  and  the  SSCN 

Having  eliminated  the  presence  of  the  two  orthogonal  solutions  of  the  dispersion  relation  and  their 
dependence  on  the  wave  impedance  in  the  conventional  stub-loaded  SCN,  by  developing  a  new  stub- 
loaded  node  with  the  matching  impedance  condition  (MSCN),  it  is  now  possible  to  investigate  more 
closely  the  effects  of  stub-loading  and  link  line  impedance  changing  on  the  numerical  accuracy  of  a 
general  condensed  node  TLM  scheme.  It  appears  most  useful  to  compare  the  dispersion  characteristics 
of  the  SSCN  and  the  MSCN,  since  the  former  uses  exclusively  the  altering  of  link  lines  to  model  local 
changes  in  the  material  properties,  while  the  later  always  uses  stubs. 

Figure  1  shows,  in  the  same  manner  as  in  [7],  the  percentage  propagation  errors  in  the  SSCN  and 
the  MSCN  for  propagation  in  a  coordinate  and  a  diagonal  plane,  for  the  discretization  of  Al/X  —  0.1. 
The  angle  cp  represent  the  angle  between  the  propagation  vector  an  the  ^r-axis.  It  can  be  clearly  seen 
from  the  plots  in  Figure  1  that,  with  an  increase  in  SrPr  the  propagation  error  is  shifted  in  the  positive 
direction  for  the  SSCN  but  in  the  negative  direction  for  the  MSCN.  The  error  trends  observed  from 
Figure  1  suggest  that  if  a  part  of  the  excess  material  properties  is  modelled  through  stubs,  as  in  the 
MSCN,  and  a  part  through  varying  the  characteristic  impedances  of  link  lines,  as  in  the  SSCN,  one  may 
expect  that  the  propagation  error  in  such  a  combined  node  might  average  out,  i.e.  the  error  may  be 
small  and  fluctuate  around  zero.  This  possibility  is  currently  being  explored  by  the  authors  in  order  to 
formulate  more  accurate  TLM  condensed  nodes. 

Influence  of  material  interfaces 


The  traditional  dispersion  analysis  for  the  TLM  nodes  applies  for  an  infinite  space  with  a  homogeneous 
medium  [13].  However,  in  real  problems,  nodes  with  stubs  and  varied  link  line  impedance  are  used  in 
order  to  model  local  increases  in  the  medium  parameters,  i.e.  for  modelling  inhomogeneous  problems.  A 
full  investigation  into  the  effect  of  interface  discontinuities  in  the  TLM  method  has  not  yet  been  described 
in  the  literature.  However,  these  effects  have  been  investigated  empirically  [14]  by  studying  frequency 
errors  in  inhomogeneous  resonators.  A  similar  approach  is  used  here  in  order  to  test  the  performance  of 
different  nodes  and  to  estimate  the  influence  of  spurious  reflections  on  propagation  behaviour.  The  two 
square  resonators  depicted  in  Figure  2,  partly  filled  with  a  medium  with  materials  parameters:  Case  1) 
Er  =  4,  /Ltr  =  1  and  Case  2)  Ct  =  Pr  =  ^  are  studied  in  the  simulations.  Note  that  the  stub-loaded  SCN 
and  the  HSCN  are  identical  in  Case  1,  while  the  stub-loaded  SCN  and  the  MSCN  are  identical  in  Case 
2.  The  resonant  frequencies  of  TE  modes  10,  20,  30,  11,  21  and  31  are  calculated  for  each  case  using 
TLM  simulations  with  different  nodes  on  a  mesh  with  node  spacing  Al  =  1cm.  The  relative  propagation 
error  normalized  to  the  benchmark  discretization  of  Al/X  =  0.1  is  calculated  as  [14]: 


ifo-f) 

f  0.1  ^ 

^  (/o  -  /) 

f  0.1c  \ 

f  * 

Ui/aJ 

/  ' 

L  A1/q  ) 

where  /  is  the  modelled  frequency,  c  is  speed  of  light  and  /o  is  the  reference  frequency  obtained  by 
performing  calculations  on  a  very  fine  mesh.  Note  that  the  normalization  is  done  with  respect  to  the 
wavelength  in  the  material.  The  results  for  the  two  resonators  are  presented  in  Tables  1  and  2. 

The  spurious  reflections  can  be  studied  by  comparing  the  results  for  resonator  b  (with  two  interfaces 
medium-air)  with  the  results  for  resonator  a.  In  all  nodes  the  accuracy  in  modelling  the  two  resonators 
differs  greatly,  expect  in  the  MSCN  where  accuracy  is  generally  the  same  (most  noticeable  in  Case  2). 
This  indicates  that  in  problems  with  multiple  interfaces  the  MSCN  might  provide  greater  accuracy.  In  the 


Figure  2:  Schematic  of  inhomogeneous  resonators  (dimensions  are  in  mm) 


TE  mode  — >• 

10 

20 

30 
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21 

31 

Case  1:  £r  = 

4,  /i,  =  1 

fo  [GHz]  ^ 

0.9116 

2.0863 

2.9980 

1.2523 

2.3176 

3.2314 

SCN  &  HSCN 

-0.22 

-0.55 

-0.25 

+0.03 

-0.35 

-0.20 

MSCN 

6k 

-0.90 

-0.90 

-0.58 

-0.15 

-0.63 

-0.40 

SSCN 

[%] 

+1.48 

+0.75 

+0,64 

+1.77 

+1.11 

+0.62 

Case  2:  £r  ~ 

2,  fir  =  2 

fo  [GHz] 

0.9993 

1.9987 

2.9980 

1.3476 

2.2845 

3.1883 

SCN  &  MSCN 

-0.82 

-0.80 

-0.78 

-0.19 

-0.50 

-0.57 

HSCN 

6k 

-0.11 

-0.53 

-0.35 

+0.03 

-0.29 

-0.28 

SSCN 

[%1 

+1.46 

+0.57 

+0.86 

+1.73 

+0.92 

+0.80 

Table  1:  Normalized  percentage  propagation  error  in  resonator  (a) 


SSCN,  the  propagation  error  for  resonator  b  is  always  shifted  in  the  positive  direction  compared  to  that 
for  resonator  a.  It  is  also  interesting  to  note  that  for  different  propagation  modes  in  the  axial  direction 
(10,  20,  30)  all  nodes,  except  perhaps  the  MSCN,  exhibit  significant  variations  in  errors.  Clearly,  this 
requires  further  investigation  but  it  would  appear  from  these  preliminary  results  that  the  lower  order 
modes  are  influenced  more  by  the  spurious  reflections  at  interfaces. 


Conclusion 


A  new  nodal  structure  for  the  TLM  method  with  condensed  nodes  was  presented.  The  ambiguity  of 
two  dispersion  solutions  dependent  on  the  wave  polarization  found  in  the  traditional  stub-loaded  SCN 
was  eliminated  in  the  new  matched  SCN  (MSCN),  enabling  better  insight  into  the  error  trends  caused 
by  the  stub-loading.  It  was  shown  that  the  numerical  errors  caused  by  spurious  reflections  at  material 
interfaces  when  modelling  inhomogeneous  problems  were  minimized  in  the  MSCN  due  to  the  matched 
impedance  condition. 
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TE  mode  ->• 

10 

20 

30 

11 

21 

31 

Case  1 

:  £r  = 

4,  =  1 

/o  [GHz] 

0.8066 

1.8316 

2.9980 

1.1346 

2.0305 

3.1452 

SCN  &  HSCN 

-f-0.28 

-0.21 

-0.43 

+0.25 

-0.05 

-0.33 

MSCN 

5k 

-0.74 

-0.91 

-0.93 

+0.02 

-0.50 

-0.75 

SSCN 

[%] 

-h2.22 

+1.49 

+1.11 

+2.14 

+1.68 

+1.27 

Case  2:  Et  = 

2,  =  2 

/o  [GHz]  ^ 

0.9993 

1.9987 

2.9980 

1.2868 

2.2094 

3.1778 

SCN  &  MSCN 

-0.83 

-0.80 

-0.78 

-0.09 

-0.48 

-0.61 

HSCN 

5k 

+0.07 

-0.14 

-0.36 

+0.18 

-0.01 

-0.25 

SSCN 

[%] 

+2.00 

+1.42 

+0.97 

+2.04 

+1.62 

+1.14 

Table  2:  Normalized  percentage  propagation  error  in  resonator  (b) 
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INTRODUCTION 

In  order  to  make  the  TLM  algorithm  more  flexible  for  the  electromagnetic  analysis  of  general  structures,  it  is 
essentia!  that  a  variable  mesh  procedure  is  applied.  Indeed,  a  fine  mesh  is  required  in  regions  where  fields  are  rapidly 
varying  in  space  (e.g.,  conducting  edges,  comers,  narrow  strips).  This  allows  one  to  reduce  computer  requirement  as  a 
coarse  mesh  can  be  used  in  other  locations  where  fields  do  not  have  high  gradients.  However,  there  are  some  constraints 
about  how  the  TLM  grid  can  be  varied  [1],  In  order  to  reduce  the  number  of  constraints,  a  more  general  TLM  node  must 
be  constmcted.  Recently,  Hoefer  and  Sautier  [2]  presented  a  2D-TLM  rectangular  node  with  arbitrary  aspect  ratio  without 
stubs.  Homogeneous  sub  regions  can  use  different  aspect  ratio  but  not  arbitrarily.  In  this  paper.  Maxwell’s  equations  in 
integral  form  are  used  to  constract  a  general  2D-TLM  cell  with  variable  time  step  by  using  a  stub.  The  hybrid  nature  of  the 
node  allow's  an  arbitrary  aspect  ratio  for  all  regions  and  arbitrary  mesh  size  variation.  When  used  with  maximum  time-step, 
the  stub  can  be  removed  and  the  proposed  node  coincides  with  the  one  proposed  by  Hoefer  and  Sautier. 

The  formulation  starts  with  a  definition  of  incident  and  reflected  voltages  in  terms  of  fields.  Maxwell-Ampere  and 
Maxwell-Faraday  equations  are  then  used  to  establish  independent  relationships  between  incident  and  reflected  voltages. 
As  a  result,  a  scattering  matrix  is  obtained  which  also  accounts  for  electric  and  magnetic  losses.  In  addition,  the 
formulation  allows  the  presence  of  magnetic  and  electric  source  current  densities.  This  demonstrates  the  general  nature  of 
the  proposed  node  which  allows  rigorous  two-dimensional  field  modeling  for  both  TM  and  TE  cases  [3],  Finally, 
dispersion  characteristics  of  the  proposed  node  is  investigated.  The  equivalence  with  the  FDTD  cell  for  arbitrary  time-step 
is  demonstrated. 

APPROXIMATED  INTEGRAL  FORM  OF  MAXWELL’S  EQUATIONS 

Consider  an  anisotropic  medium  for  which  permittivity,  permeability  and  conductivity  tensor  are  diagonal. 
Maxwell-Faraday  and  Maxwell- Ampere's  equations  can  be  written  as 

^HodT=  ffs  — ods+ JJ&eEods-i- Jpeods  (1) 

<fEcdT  =  -  Jp^ods-  Jp^H  =  ds-  JP^  ods  (2) 

r  s  ^  s  s 

where  Jg  and  Jm  are  electric  and  magnetic  source  current  densities,  respectively  and,  e  =  EQdiag(exx,eyy,e2z) , 

p  =  Podiag(Pxx>Myy>Iizz),  -^‘^s(rrex’<^ey’^ez),  =  diag(amx»rTn,y,am2)  are  permittivity,  permeability,  electric 

conductivity,  magnetic  conductivity  tensors,  respectively.  In  order  to  approximate  (1)  and  (2),  consider  the  rectangular  cell 
with  dimensions  Ax  and  Ay  (a=Ay/Ax).  Without  loss  of  generality,  let  us  investigate  the  TM  case  (Transverse  Magnetic) 
for  which  electric  and  magnetic  fields  are  sampled  as  shown  in  figure  1.  Samples  are  taken  every  At  at  time  (k±l/2)At  and 
(k-i-l/2)At  if  no  time  reference  is  mentioned.  Taking  the  path  T  in  the  plan  ZOY  (figure  2),  an  approximation  of  (2)  can  be 
written.  Hence,  averaging  betw'een  time  (k-l/2)At  and  (k-i-l/2)At,  one  obtains 

^EodT  =  ^Al(E3"-Ef)  (3) 

r  ^ 
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for  which  the  field  was  supposed  constant  along  die  sides  of  the  cell,  where  T  is  the  retardation  operator  given  by 
Tu<*')  _  and  A1  an  arbitrary  spatial  step.  The  right-hand  side  of  (2)  can  also  be  approximated,  which  with  (3)  gives: 


A,  .(H^  +H3*)-Z„R„„A,  Hf  - v£ 


(4) 


where  Rm*  = 


1  gmx^-^y 


and  =  jiS  '  Ay  appendix  A).. 


Figure  I:  Generic  cell  for  field  sample  defimficm  (TM  case)  Figure  2:  Path  for  approximation  of  (2) 


At  this  stage,  one  has  the  elements  to  define  the  incident  voltage  vector  a  (at  time  [k-l/2]At)  and  the  reflected 
volt^e  vector  b  (at  time  Ek+I/2]At)  which  caa  be  written  as 


a  =  i[Al  Ef  +  ZyA^  Hr,  AI  E^ -Z^^Ay -H^,  AI  E3 -ZyA^  Hs,  Al-E^  +  Z^AyH^,  AIE5J 
b  =  |[Al  Ef-ZyA^.H^  Al-El  +  Z^Ay-Hl,  Al-Ef +  ZyAx  AI  E^-Z^Ay  H^,  AlEsJ 


(5) 


where  Z;  =  Zo/Vi ,  with  normalized  admittmce  %  is  defined  in  [4]: 

Y  cAt  A^  _  cAt  1  _  cAt  Ay  _  cAt  ^  (6) 

^  PxxAy’Al  PxxAl  tx  PyyAx'Al  PyyAl 

Finally,  using  (4)  and  (5)  yields: 

Jk+Oi)  ,(k+Oi)  (k-0,5)^^(k-05)  yi  .  „(k)  (k)  (7) 

bs  -b]  =-83  +ai  -ZoRmxAxHx  -Vmx  '■  ^ 

Using  tibe  same  procedure  in  the  ZOX-plane  yields: 

-b'r^  .  -a?-”'  -Zofe«yiyH?’  -V«  (8) 

Similarly,  approximating  (1)  on  the  XOY-plane  (see  figure  1)  gives  a  third  relation.  This  time,  five  samples  are  used  to 
approximate  E^  (appendix  A): 


255 


(9) 


E  _Yx(E^  +  E^)+VEi+E3)-^YsE5 

^  2Yx  +  2Yy+Ys 


where  Yj  is  the  stub  normalized  admittance  de&ied  by  E5  (see  figure  2)  and  which  is  calculated  by  setting: 

A.Av 


Finally, 


8„-^  =  Yx+Yy  +  Ys/2 

^  cAtAl  y  ® 


Yy(bi  +  +  Y^(b2  +  b4f  +  (Y3  + 

=  Yy(a,.a3f-«^.Y.(a2.a4f-^^U(Y3-G.)ar^^ 


(10) 


(H) 


F^re  4:  Evaluation  of  Ez*'^  in  the  YOZ-plane 

Before  determining  the  two  remaining  relatioa^y^,  one  has  to  establish  expressions  for  Hx'^^and  .  For  this  purpose, 
two  paths  Fi  and  r2  are  considered  in  the  XOY-plane  (figure  3).  Thus: 

^lldT  =  2aX‘'^  -^^^A^(Hr  +  H3") 
r  ^ 

Taking  into  account  that  Sj  and  S2  have  opposite  normal  vector,  the  only  non-zero  integral  contribution  through  S  (within 
die  assumed  approximation)  is  the  one  due  to  die  displacement  current.  Indeed,  others  are  approximated  by  values  at  the 
center  of  the  rectangle.  Hence, 


and 


A.Hf4|[(b3-b,r“'-(a3-a.r“>] 

(12) 

A,H«.i|[(b3-b,r«^>-(a,-a,f-“>] 

(13) 
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Now,  one  can  calculate  =(1  +  T)E5 /2  along  two  paths,  each  one  contained  in  mutually  perpendicular  planes.  For 
instance,  applying  the  procedure  pertaining  to  figure  4  allows  one  to  obtain: 

2(br-^^ar-^>)  =  (b,  .b3f  (14) 

Similar  reasoning  yields  the  last  relation: 

2(b5  +a5  )  =  (b2  +  b4)'  +(a2  +  a4)  (lo) 

Equations  (7),  (8),  (1 1),  (14)  and  (15)  can  be  written  in  matrix  form: 

[p,>b<>^*''«=[P2>a"‘-'’«-[P3]"»"‘>  (16) 

Introducing  [5]  and  [S^]  the  scattering  voltage  and  source  matrices,  respectively  (see  appendix  B): 


.  -  r-,{k)  -/k)  -/k)i‘ 

where  u  -  Vg^  ,  >  ^my 


-1-lR^Yy  0  l-f^R^Yy  0  0 

_1  0-1  0  2 

0  -10  -12 


l-^Vy  0  ® 

0  0  1-:^R^Y,  0 

10  10-2 

0  1  0  1-2 


d  [P3]=  01000  .In  the  above,  Ggz  =  Zq - - — ^  Rmk  =  Y~ 


1  c^mk^I-Aj 


NODE  MAIN  FEATURES 

For  the  sake  of  simplicity,  consider  first  Pxx  =  M-yy  =  Pr ,  and  e^z  =  s, .  In  this  case,  using  (6)  and  (10)  yields: 


1 

2£rAxAy  2cAt 

^a^+r 

A1 

cAt  Pr 

^  «  J 

In  order  to  eliminate  the  stub  ( Yj  =  0 ),  At  is  maximum  and,  according  to  (18),  is  given  by 

M  A  _ ! _ 

^Imax-  ^x  pr -  ^y  rp 

Va  +1  ^  Va  +1 


It  is  easy  to  see  that 


[22,  2 

-a  a  la 

1  1  -1  1 

-  ,222 

oc  4-1  1  a  -a  a 

1  1  -1 


which  is  indeed  the  scattering  matrix  of  the  node  without  stub  proposed  by  previous  authors  [2,3],  Hence,  the  general 
nature  of  the  proposed  node  is  demonstrated. 
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The  next  step  is  to  consider  the  TLM  algorithm  and  investigate  whether  a  reduction  of  the  number  of  operations  is 
possible.  By  manipulating  (12)-(15)  one  can  establish  the  following  relation: 


AlE<-> 

-ZyA 

TT(k)l 

xkix 

AIEf 

+  ZxA 

o(k) 

ytly 

+  ZyA 

u(k) 

x”x 

AIE^,^^ 

-Z,A 

„(k) 

yHy 

AIE^*'^ 

As  a  result,  only  field  values  at  the  center  of  the  cells  are  needed  and  can  be  calculated  from  incident  voltages  [4]  and 
adding  source  contributions  if  any.  For  instance,  by  using  (14)  and  (16): 


AlE^M.2^(a,+a3)  +  ;^(a2  +  a4) 


V(k) 


■  (k-l/2)  . 

(aj-aj)  -O.SV^x  and 


^  ■  ?.  [(a,  -0.5viyl  where  Yeq  =  ^ +Gez +2Y,  H-2yy  . 

i  +  RmyYxL 

To  achieve  the  construction  of  the  hybrid  node,  one  has  to  consider  the  TE  case.  The  above  relations  remain  valid 
if  one  makes  the  following  permutations: 

E  — >  H  ~E  5  1^  ^  ^  ^  M  ,  Jg  — ^  J ij]  — ^  “^e  j  ^  and  (2,  R)  (Y,  G) . 

VARIABLE  MESH 

As  mentioned  before,  the  advantage  of  the  hybrid  node  is  that  arbitrary  variation  of  the  mesh  size  within  the  same 
region  can  be  performed.  The  only  constraint  is  that  it  has  to  remain  structured,  i.e.,  every  cell  has  always  exactly  four 
neighbors  (see  figure  5).  This  requires  the  use  of  the  same  time  step  At  throughout  the  network.  The  selection  is  such  that 
numerical  stability  is  guaranteed,  i.e.,  >  0  for  all  nodes. 


^’^0  I 

A  I  A 

-  -  -  •  <--> 
V  z'J)  j  7(2)  V 


Figure  5  :  Irregular  structured  mesh 


Figure  6:  Transfer  of  impulses  between  cells  with  different  size. 


In  addition,  appropriate  transfer  of  impulse  voltages  towards  neighboring  nodes  is  applied  to  insure  that  continuity  of 
tangential  components  of  both  magnetic  and  electric  fields  is  enforced.  For  instance,  voltages  at  interface  (figure  6)  are 
given  by 

bi^^^Al  E^-Zx^Ay-Hy  and  b2  ^  =  Al-E^ +  Zx^Ay  Hy  (20) 

aJ^^Al  Ez+Z^^^Ay-Hy  and  a^^^  =  Al-E^ -Z^^^Ay -Hy  .  (21) 


where  Ez  and  Hy  are  the  tangential  components  at  the  interface  between  cell  1  and  2.  Hence,  transfer  in  x-direction  is 
made  as  follows  [5]; 


(4‘>1 

J  P  I'PI 

1 - 

'  » 

Q. 

( 

Q, 

+ 

1 

Lb?J 

where  the  reflection  coefficient  is  p  = 


7(2)  7(1) 


NUMERICAL  DISPERSION 

Consider  a  homogeneous  and  lossless  medium  for  which  s  =  SoSf  and  \x  =  Maxwell’s  equations  in 

local  form,  the  wave  equation  for  the  TE  case  can  be  easily  derived.  A  decomposition  in  a  plane  wave  spectrum  gives  a 
dispersion  relation  [6]: 

cop  (kx +ky-a)^ep)  =  0 

2  2  2 

where  o=27tf  and  k  =  [kx,ky3*  is  the  wave  vector.  The  case  ®  =  0  is  the  electrostatic  case  and  co  pe  =  kx+ky 
corresponds  to  traveling  waves.  Reflected  voltages  can  also  be  written  as: 

P'1) 

where  x  =  kxAx  ,  y  =  kyAy  and  (m,n)  are  the  node  central  discrete  coordinates.  For  neighboring  nodes,  voltage  transfer  is 
written  as: 

(k-l/2)At  _  r(k-i/2)At 

^m,n  “I'J  nm,n  (25) 

where  the  transfer  or  connection  matrix  [T].  Elements  are  zeros  except  Ti  3  =  Y,  T24  =  X,  T3]  =  Y  \  T42  =  X  T55  = 
l,with  X  =  e^’‘  and  Y  =  e^^ .  Finally,  (16)  becomes: 

(X[Pi]-tP2]“[T])“bo  =[M(l)]obo  =  0  (26) 

where  X  =  e”"*®  and  0  =  ooAt.  Dispersion  equation  is  obtained  by  setting  det([M(X)])=0.  One  can  show  (see  appendix  C) 
that  \  =  ±1  are  eigenvalues  of  first  and  second  order,  respectively.  The  eigensolution  associated  to  =  1  is  the  electrostatic 
solution  of  (23)  and  X  =  -\  corresponds  to  a  high-frequency  spurious  mode  (f  =  l/2At)  [3].  The  traveling-wave  equation 
can  be  written  as: 


(  ©At) 

i  21 

1  1 

fcmAtl 

fsin^f-1 

1  2  J 

^  Ax  ^ 

(2J 

1  Ay  J 

where  0<At<At^ax.  At^^^x  given  by  (19)  and  c^  is  the  medium  velocity.  Setting  e  =  c)^  =  2jr^,  s  = 
x  =  kxAx  and  y  =  kyAx ,  dispersion  relation  becomes; 

It  turns  out  that  (28)  has  the  same  form  as  for  the  FDTD  cell  [7].  Thus,  there  is  a  complete  equivalence  in  terms  of 
dispersion  performance  and,  also,  the  stability  limit  is  the  same  [8].  However,  the  algorithm  for  field  computation  is 
different. 

CONCLUSIONS  .  ^  f 

In  this  paper,  a  new  two-dimensional  TLM  node  is  presented.  It  is  constructed  from  the  approximation  of 
Maxwell- Ampere  and  Maxwell-Faraday  laws  in  integral  form.  The  final  node  is  hybrid  as  a  stub  is  connected  to  vary  the 
time-step.  This  feature  makes  it  flexible  as  non  regular  (but  structured)  mesh  can  be  used  with  arbi^ary  variation.  In 
addition,  the  formulation  is  complete  which  gives  to  the  node  a  general  form  as  constant  magnetic  and  dielectric  losses  are 


259 


accounted  for  and  magnetic  and  electric  source  current  densities  can  be  introduced.  Also  media  represented  by  (non- 
dispersive)  diagonal  tensors  can  be  modeled.  Finally,  it  is  found  that  at  maximum  time-step  value  the  proposed  node 
corresponds  to  the  rectangular  cell  with  arbitrary  aspect  ration  proposed  by  previous  papers. 

In  terms  of  dispersion,  theoretical  investigation  for  homogeneous  and  sourceless  medium  and  uniform  rectangular 
mesh  shows  that  the  2D-TLM  hybrid  node  has  the  same  dispersion  relation  as  in  the  case  of  the  2D-FDTD  node.  This 
equivalence  is  valid  for  arbitrary  time-step  value  within  the  stability  interval.  However,  if  performances  in  terms  of 
dispersion  are  identical,  the  corresponding  computational  algorithms  are  different  as  fields  are  not  directly  computed  with 
TLM  and  it  needs  more  variables  than  FDTD.  Finally,  optimization  techniques  are  implemented  to  accelerate  the  TLM 
algorithm.  For  instance,  computing  time  for  reflected  voltages  can  be  reduced  substantially  by  first  evaluating  field 
components  at  the  center  of  the  cell.  Then,  reflected  voltages  are  computed  by  a  reduced  number  of  operations  without 
using  the  scattering  matrix  as  it  is  usually  done  in  TLM. 
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APPENDIX  A  : 

Evaluation  of  current  density  fluxes  (see  (1)  and  (2))  can  be  achieved  by  taking  an  average  value  of  fields  ( Ei^  and 
)  at  the  center  of  S  (see  figure  A.l)  and  an  approximation  in  time.  Hence, 


JgJ  at  Zq  cAt  Ak 


(A.l) 


(T-1)  PkkAjAj 
CAt  Ak 


AkHk 


{A.2) 


in  which 


Fk  = 


1 

AiAj 


IjF.kds  with  F  =  EorH.  Now,  one  has  to  evaluate  an  average  value  of  the  normal  component  of  F 
S 


by  using  know'n  values  on  the  surface. 


Figure  A.l  -  Surface  integral  approximation 
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A  GENERAL  FORMULATION  OF  A  THREE  -  DIMENSIONAL  TLM  CONDENSED  NODE  WITH  THE 
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INTRODUCTION 

TLM  is  known  as  a  numerical  model  of  propagation  based  on  Huygens  principle.  The  formulation  is  based  on 
the  analogy'  between  electromagnetic  wave  propagation  and  electrical  signal  propagating  in  a  network  of  interconnected 
lines  or  guides.  However,  a  deduction  of  the  TLM  algorithm  directly  from  Maxwell'  equations  seems  more  appropriate. 
Indeed,  this  provides  to  a  better  understanding  of  the  method.  As  a  result,  the  possibility  to  combine  the  TLM  with  other 
numerical  techniques  based  on  field  formulation  is  enhanced.  For  instance,  efficient  boundary  conditions  have  to  be 
implemented  for  open  problems.  These  conditions  are  generally  formulated  in  terms  of  fields  which  cannot  be  directly 
transformed  in  electrical  quantities  for  TLM.  Recently,  Krumpholz  and  Russer  [1]  derived  the  TLM  scattering  matrix  for 
both  two  and  three-dimensional  cases  by  using  an  approach  based  on  moment  method.  Jin  and  Vahldieck  [2]  made  a 
similar  derivation  for  the  3D  basic  (SCN)  and  hybrid  (HSCN)  symmetrical  condensed  node.  Their  approach  is  based  on  a 
finite-difference  approximation  of  Maxwell’s  equations. 

In  this  paper,  a  general  3D  condensed  node  is  formulated  based  on  the  approximation  of  Maxwell-Ampere  and 
Ma.vivell-Faraday  equations  in  global  fonn.  This  new  contribution  consists  in  a  systematic  procedure  which  allows  a 
general  formulation  w  hich  includes  all  ty^aes  of  condensed  TLM  nodes  namely,  the  super  condensed  sy'mmetrical  node 
(SSCN)  [3],  the  HSCN  and  the  basic  SCN  proposed  by  Johns  [4].  Integral  form  of  Maxwell-Ampere  and  Maxwell- 
Faraday's  laws  produce  a  sufficient  number  of  independent  relations  in  terms  of  incident  and  reflected  voltages  at  nodes. 
Then,  a  general  scattering  matrix  describing  all  types  of  nodes  is  derived  as  well  as  expressions  for  fields  at  the  center  of  a 
parallelepiped  TLM  cell.  In  addition,  the  general  formulation  includes  electric  and  magnetic  losses  [9]  and  anisotropic 
media  as  long  as  these  cans  be  modeled  by  diagonal  (real)  tensors.  Finally,  the  node  accounts  for  insertion  of  magnetic  or 
electric  source  current  densities. 

MAXWELL’S  LAW  INTEGRAL  FORM  APPROXIMATION 

Let  us  start  with  the  two  first  Maxwell’s  laws  in  integral  form  in  the  case  of  media  which  can  be  lossy, 
anisotropic  (diagonal  real  tensor)  and  in  which  electric  or  magnetic  sources  can  exist. 

fHodT  =  |Je— ods  +  j|a,Eods  +  J|jeods  (1) 

r  s  ^  s  s 

f  E  O  dl  =  - Jf  A ^  o  ds  -  J/ a,H  0  ds  -  fj  J,  o  ds  (2) 

r  s  s  s 

where  J^  and  are  the  electric  and  magnetic  source  densities,  respectively  and  s  =  sodiag(exxiSyy.£zz), 
A  =  irodiag(Pxx'^yyP2z),  =  diag(aex,cJey,crez) ,  =  diag(cjn,x.Omy.o„iz)  are  isotropic  medium  constitutive 

parameter  tensors.  In  order  to  approximate  (1)  and  (2),  consider  the  parallelepiped  cell  of  dimensions  Ay  and  A^ 

illustrated  by  figure  1.  As  defined  in  [1],  tangential  field  components  are  sampled  at  the  center  of  the  faces  of  the  cell. 
Thus,  there  is  a  total  of  twelve  samples  for  both  fields  to  account  for  all  cases  of  polarization.  Electrical  samples  are 
numbered  according  to  the  SCN  model  presented  by  Johns  (4)  (see  figure  1).  Conveniently,  magnetic  field  samples  with 
corresponding  number  are  located  at  the  same  point  on  the  face  but  oriented  perpendicularly  with  respect  to  their 
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electrical  corespondent  (e.g.,  and  ).  In  addition,  either  electric  or  magnetic  field  samples  at  the  center  of  the  cell 
are  required  for  both  SCN  and  HSCN  [2]. 


Figurel:-Basic  cell  for  3D  TLM  Figure  2:  Maxwell-Ampere's  law  in  the  YOZ-plane 

First  of  all,  let  us  define  a  and  b  the  local  incident  and  reflected  voltage  vector  .  respectively,  according  to  the  definition 
used  in  [I]; 


where  components  for  the  various  case  are  defined  in  appendix  A.  Then,  (1)  is  evaluated  in  the  YOZ  according  to  the 
path  detailed  in  figure  2.  For  evaluation  of  (2),  the  plane  XOZ  is  used  (see  figure  1).  Without  loss  of  generality,  losses  and 
sources  are  not  considered  for  the  moment.  Line  integrals  are  evaluated  by  averaging  field  values  between  time  (k-l/2)At 
and  (k-t-l/2)At  and  assuming  them  constant  on  each  side  of  the  paths: 

f  H  o  dT  =  |(1  +  T)[A,(Hf2  -  Hf)  +  Ay (H^  -  H^)]  (4) 

f  E  0  d!  =  -(1  +  T)k(Ef  -  E|)  +  AyiEl  -  E^)]  (5) 

r  ^ 

where  T  is  the  delay  operator:  T  u^''^  =  and  fields  are  supposed  evaluated  at  time  (k+l/2)At. 

Approximation  of  the  right-hand  side  of  (1)  and  (2)Js  done^  in  [6]:  The  flux  is  computed  by  an  average  value 
of  the  fields  at  the  center  of  the  surface.  Hence,  for  the  SCN  E^  and  ,  need  five  field  values: 


4  +  Y„ 


4  +  Z,^. 


(6) 

(7) 


where  the  stub  normalized  admittance  and  impedance  are  defined  by  E13  and  Hfg  ,  respectively: 
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Hence,  we  obtained  six  relations  for  the  SCN,  HSCN  and  SSCN.  Thus,  18,  15  and  12  more  relations  are  needed  for  these 
nodes,  respectively.  Evaluating  fields  in  two  perpendicular  planes  and  at  the  center  of  the  cells  allow  one  to  obtain  more 
relations  between  incident  and  reflected  voltages.  For  instance,  path  Fj  and  Fj  illustrated  in  figure  3  are  considered  and 
field  y-component  at  the  center  of  the  cell  at  time  kAt  evaluated. 


The  line  integral  of  H  along  F  =  Fj  u  F2 ,  assuming  is  the  average  value  between  time  (k-1/2)  and  (k+1/2),  yields: 


|H,dT=Ay 


2HW-i(l  +  T)(Hy+Hy  ) 


(14) 


Using  the  same  assumptions: 

SiuS2 

for  SSCN.  For  the  SCN  and  the  HSCN,  one  has  to  change  Yq  into  YoYy .  Hence,  Maxwell-Ampere's  law  yields: 

Similarly,  Maxwell-Faraday's  law  yields: 

Using  both  laws  in  the  YOZ-plane  (figure  I)  yields: 

24,  hW  .  Y„[(b<“'«  -  bfk-''^)  + 

2  4,  eM  =  bl''^'')  +  b<'‘'^''^>  +  +  al"-'''! 

Equations  (16),  (18)  and  (17).  (19) ;  allows  one  to  write  two  more  relations: 

Yxz(bio  -b6)  + Y2x(b9  -b2)==Yx2;(aio-a5)  + Y2x(a9  -32) 
b3  -  b4  -  bg  +  bj  I  =  ”  83  +  ^4  +  ag  -  aj  1 


(15) 

(16) 

(17) 

(18) 

(19) 

(20) 
(21) 


In  [5],  equation  (20)  and  (21)  are  presented  as  continuity  equations  for  magnetic  and  electric  field  components, 
respectively.  At  this  stage  we  have  twelve  independent  relations  relating  incident  and  reflected  voltages.  This  is  sufficient 
for  the  SSCN,  while  the  HSCN  and  the  SCN  need  three  and  six  others  relations,  respectively.  This  can  be  done  by 
evaluating  field  components  at  the  center  of  the  cell.  Consider  for  instance  E^*'^  =  ,  one  can  use 

(17)  or  (19)  to  establish  an  other  relation. 
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To  summarize,  from  integral  fonn  of  Maxwell's  laws,  relations  between  incident  and  reflected  voltages  for  all 
condensed  t>-pe  of  TLM  nodes  are  found.  They  can  be  written  under  the  matrix  form: 

^22) 

in  which  matrices  [Pijand  [P2]  are  presented  in  appendix  B  for  the  lossy  case.  The  scattering  matrix  is  readily  found 
from  (22): 

[s]=[p.rV2]. 

Finally,  it  is  worth  to  mention  that  the  formulation  presented  by  Herring  and  Cristopoulos  [8]  is  based  on  the  magnetic 
flux  and  electric  charge  conserv'ation  and  field  continuity  from  transmission-line  theory. 

LOSSES  AND  SOURCE  MODELING 

Loss  modeling  in  TLM  consists  of  accounting  for  the  surface  integral  contribution  due  to  current  densities  in 
terms  of  incident  and  reflected  voltages.  This  can  be  approximated  by  considering  field  values  at  the  cell  center.  For 
instance,  the  integrals  on  S  illustrated  in  figure  2  can  be  approximated  as  follows: 

JJa^E  o  ds  =  (electric  losses)  (24) 


where,  in  general  form: 


(magnetic  losses) 


^  AjAj 

Gek-C?ek"7 —  and  Rmk  =<=fmk— ; —  k  =  X,  y,  z 

^k  Ak 


(25) 

(26) 


For  the  SCN  and  the  HSCN,  normalized  conductance  Ge^  and  resistance  are  used  [2J.  Terms  due  to  losses  are 
introduced  in  [Pi]et  [P2]  (see  appendix  B)  by  using  for  instance  e|^^  =  .  Hence,  (24)  becomes: 


|aeE.ds=^Gex(at3-''^^+biT>)  with  6,^  =  ZoG.^ 

Finally,  a  similar  approach  is  taken  for  contributions  due  to  source  integrals  which  transforms  (22)  as  follows: 

[Pj]b(k+0.5)^[p^]g(k-0.5)_jp^j5(k) 

where  u^*^^  in  the  source  vector. 


(27) 


SIMULATION  RESULTS 

In  order  to  validate  TLM  source  modeling,  a  waveguide  is  excited  by  an  electrical  current  sheet  with  sinusoidal 
spatial  distribution.  The  current  source  is  a  gaussian  pulse  amplitude  modulated  by  a  carrier  at  frequenc>'  fo .  Parameters 
are  adjusted  such  that  the  excitation  frequency  spectrum  is  above  waveguide  cutoff.  Constitutive  parameters  are 
=  1,  p[.  =  4  and  A1  /  ?,fnax  =  0-0^7  .  A  comparison  between  the  different  nodes  in  terms  of  time  response  in  shown  in 
figure  4.  Dispersion  introduced  by  the  TLM  model  can  be  observed  when  compared  with  the  theory.  No  significant 
difference  is  observed  between  the  different  node  modeling.  One  can  also  observ’e  the  physical  dispersion  introduced  by 
the  guide.  Finally,  figure  5  shows  the  dispersion  characteristics  of  a  microstrip  cavity  on  anisotropic  substrate. 
Comparison  is  made  between  the  distributed  TLM  node  (ND)  and  FDTD  [14],  SCN  and  SSCN.  It  is  obseiwed  that 
differences  are  increasing  with  frequency.  The  various  nodes  behave  differently  when  numerical  dispersion  becomes 
significant  (Al/?.  s  0.1  a  10  GHz).  Finally,  this  application  shows  the  applicability  of  the  SSCN  [3]  for  anisotropic  media. 
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Figure  4:  Numerical  dispersion  (waveguide  above  cutoS)  Figure  5:  Dispersion  for  shielded  microstrip  line  - 


CONCLUSION 

In  this  paper,  a  general  3D-TLM  formulation  including  all  condensed  nodes  SCN,  HSCN  and  SSCN)  and 
derived  directly  from  field  Maxwell's  law  in  integral  form  was  presented.  The  general  scattering  matrices  relating 
incident  and  reflected  voltages  at  every  node  account  for  electric  and  magnetic  losses  and  allows  the  insertion  of  electrical 
and  magnetic  source  current  densities. 

The  presented  algorithms  have  been  validated  through  electromagnetic  modeling  of  homogenous  and 
inhomogenous  structures.  In  addition,  current  source  modeling  capability  of  the  proposed  node  is  demonstrated.  Also, 
structure  including  anisotropic  media  was  analyzed  and  satisfactory  results  were  found.  In  addition,  comparisons  between 
all  three  types  of  condensed  nodes  namely,  the  SCN,  HSCN  and  SSCN  seem  to  confirm  a  slightly  better  performance  in 
terms  of  dispersion  for  the  SCN.  However,  other  criteria  must  be  accounted  for,  in  order  to  issue  any  conclusion  about  the 
overall  performance  of  the  nodes.  In  particular,  number  of  operations  required  per  iteration  and  the  computer  storage. 
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APPENDIX  A 

Incident  and  reflected  voltages  (see  eq,  (3))  for  the  SSCN  are  defined  as  follows: 


A^Ei  -  Zy^A^Hf 

'A,Ef  +  Zy^A,Hf  ■ 

AjjE2 

A,E2^-Z,,AyHy 

AyEj  +  ZxyA^H^ 

AyE^  -  Z^yA^Hg 

AyE4  -  ZzyAxH4 

AyE^  +  ZjyAj^H^ 

Az^S  ^yzAxHs 

AjEy  -  Zy^^AxHl 

A,E^-Z,,AyH^ 

b  -  ^ 

AzE|  +  Z,,AyHy 

A,Ef-Zy^A,H7^ 

^SSCN  -  - 

A,Ef+Zy^A,H^ 

AyE^  +  Z^yA^Hg" 

AyEg  -  Z^yAxHg 

A,E|-Z,,Ay,Hy 

AxE9  +Z2.xAyH9 

Az^lO  +  ^XzAyHj’g 

AzEio  “  ZxzAyHj'o 

AyEj^i  -  Z^yA^Hji 

AyE^l  +  Z^yA^Hfi 

A,Ef2+Zy,A,Hf2 

AxEf2  -ZyxAzHf2 

where  Zjj  with  i,  j  e{x,y,z}  and  i  j  are  the  line 

impedance  along  the  axis  i  and  polarized  in  the 
direction  j.[3].  Furthermore,  the  same  above 
definition  applies  for  the  SCN  by  setting  all 
impedances  to  Zq  .  For  the  HSCN 


=  -Zxy  =  Zyx .  Finally,  field  samples  at  the 

*2 


center  of  the  cell  are  accounted  by  vectors; 
VE=^[A,Ef3  AyEf,  A.EfjJand 

~  AyHjy  A^Hjjj  . 
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The  structure  of  [PjJand  [P2](see  eq.  (22))  is: 


where  [P;]  -  [PjJ  for  the  SSCN  are  given  below: 


n  IS  18 
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where  ±  j  ^ 

4*’ =  ±l  +  jRr*Yii. 

with  i,j,k  e{x.y,z},i9t  jjik  and 

(k,j)€{(x.y).(y,z),(z,x)j. 

For  the  SCN  same  matrices  apply,  but 
with: 

Yij  =  land  Gek=0  =  R^. 
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INTRODUCTION 

Time  domain  numerical  solutions  have  enjoyed  an  increase  in  popularity  over  the  past 
decade  as  more  powerful  and  affordable  computational  platforms  have  made  their 
application  in  practical  engineering  design  work  feasible.  The  ability  to  model  arbitrary 
geometric  detail  and  homogenous  regions  with  ease  enables  the  design  engineer  to  use  a 
single  time  domain  code  to  solve  a  very  wide  variety  of  electromagnetic  interaction 
problems.  Two  techniques  which  have  become  quite  prominent  in  time  domain 
electromagnetic  analysis  are  the  finite  difference  time  domain  (FDTD)  method  and  the 
transmission  line  matrix  (TLM)  method.  Both  of  the  methods  are  differential  equation 
based  time  domain  field  solvers  and  their  similarity  has  invited  numerous  comparison 
between  the  two.  Most  of  these  studies  have  been  of  a  theoretical  nature  comparing  the 
mathematical  equivalence  and  differences  between  the  various  formulations  of  the  two 
algorithms  [1, 2,3,4],  Advantages  and  disadvantages  to  each  of  the  two  methods  have 
been  touted  by  various  authors.  Some  of  these  studies  claim  that  FDTD  is  superior 
because  it  requires  fewer  computer  operations  per  nodal  update  and  less  memory  storage 
per  cell.  In  addition,  FDTD  does  not  suffer  from  spurious  solutions  that,  theoretically  at 
least,  can  corrupt  TLM  results.  On  the  other  hand,  TLM  practitioners  tend  to  cite  the 
superiority  of  TLM  due  to  less  numerical  dispersion  and  denser  spatial  field  sampling 
leading  to  finer  boundary  resolution  for  a  given  mesh  size.  To  date,  however,  no  thorough 
study  comparing  the  accuracy  and  efficiency  of  the  numerical  results  obtained  firom  TLM 
and  FDTD  when  applied  to  problems  in  identical  settings  has  been  published. 

In  this  paper  we  compare  the  performance  of  FDTD  and  TLM  in  terms  of  numerical 
dispersion  for  meshes  that  use  non-uniform  variable  grading.  Specifically,  we  examine  the 
error  in  the  propagation  constant  calculated  with  the  two  methods.  In  addition  we  will 
comment  briefly  on  the  impact  that  the  existence  of  spurious  field  solutions  in  condensed 
node  TLM  algorithms  has  in  practical  applications.  While  the  computational  resources  are 
a  very  important  part  in  the  comparison  of  any  numerical  techniques,  a  thorough  study  is 
beyond  the  scope  of  this  paper  and  consequently  will  be  addressed  only  briefly. 


NUMERICAL  METHODS 

We  will  assume  a  familiarity  with  the  basic  workings  of  both  the  FDTD  and  condensed 
node  TLM  algorithms.  Only  the  basic  features  unique  to  the  particular  analysis  codes  used 
in  this  study  will  be  discussed. 

The  TLM  program  used  here  is  based  on  the  hybrid  symmetrical  condensed  node  (HSCN) 
formulation  [5].  This  node  is  formulated  to  easily  permit  a  non-uniformly  graded  cartesian 
mesh  to  be  implemented.  As  with  all  condensed  node  TLM  algorithms,  all  six  vector  field 
components  are  located  at  cell  centers.  In  addition,  tangential  electric  and  magnetic  field 
components  are  available  on  the  cell  faces.  In  practical  engineering  problems  it  is  usually 
advantageous  to  operate  the  algorithm  at  the  largest  time  step  permissible.  This  practice 
has  been  followed  here  where  the  maximum  time  step  is  chosen  based  on  the  requirement 
that  all  impedances  in  the  condensed  node  mesh  remain  positive.  For  the  purposes  of 
terminating  the  numerical  simulation  space,  a  second  order  Higdon  absorbing  boundary 
condition  (ABC)  is  used. 

The  FDTD  program  used  in  this  study  was  derived  based  on  a  curvilinear  algorithm  to 
achieve  variable  mesh  grading.  The  nonuniform  orthogonal  implementation  [6]  maintains 
the  accuracy  of  the  curvilinear  algorithm  while  keeping  the  same  form  for  the  staggered 
grid  leap-frog  update  equations  as  used  in  the  standard  Yee  algorithm.  The  time  step  is 
chosen  to  satisfy  the  Courant  stability  condition  and  second  order  Higdon  ABCs  are  used 
to  truncate  the  computational  domain. 


RESULTS 

In  this  section  we  compute  the  error  in  the  propagation  constant  for  the  FDTD  and  TLM 
algorithms  in  waveguide  structures  which  are  discretized  using  both  uniformly  and  non- 
uniformly  graded  meshes.  The  error  in  propagation  constant  for  waves  propagating  in  the 
numerical  meshes  arises  from  the  numerical  dispersion  that  the  waves  experience  as  they 
travel  through  the  discretized  problem  space.  The  problem  of  dispersion  has  been  studied 
analyticaUy  quite  thoroughly  for  both  the  FDTD  [7]  and  TLM  [8]  algorithms.  Many 
analytic  investigations  are  limited  for  the  most  part  to  the  study  of  uniformly  graded 
meshes.  (By  uniform  grading  we  refer  to  a  mesh  in  which  the  spatial  discretizations  in  the 
X-,  y-  and  z-axis  directions  (Ax,  Ay,  Az)  can  differ,  but  do  not  change  throughout  the 
problem  space  resulting  in  non-cubic  cells  of  equal  dimensions.  On  the  other  hand,  non- 
uniform  grading  refers  to  those  case  where  Ax,  Ay  and  Az  may  change  from  cell  to  cell 
throughout  the  mesh). 

Two  types  of  waveguides  are  studied  here:  a  parallel  plate  waveguide  and  a  standard  WR- 
90  waveguide.  These  examples  are  by  no  means  intended  to  provide  a  thorough 
investigation  of  dispersion.  Rather,  our  intent  is  to  provide  some  insight  into  the  effects  of 
dispersion  in  meshes  used  for  modeling  a  very  limited  number  of  practical  waveguiding 
stmctures. 
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In  order  to  compute  the  numerical  propagation  constant  in  the  mesh  the  temporal 
signature  of  the  fields  is  saved  at  two  points  a  distance  of  L  apart  in  the  direction  of 
propagation  (the  z-direction  for  the  examples  presented  here).  Fourier  transform  of  these 
time  domain  fields  yields  the  phase  shift  between  the  points  as  a  function  of  frequency. 
Knowledge  of  the  phase  shift  and  distance  between  the  points  allows  the  calculation  of  the 
numerical  phase  constant,  Pn .  The  percent  error  in  the  propagation  constant  is  defined  as 

Ip  -p  I 

%  Error  In  Propagation  Constant  =  '  'xlOO 

where  Pt  is  the  theoretical  propagation  constant.  A  Gaussian  pulse  was  used  as  the 
excitation  waveform  in  all  cases. 

The  first  structure  examined  is  the  parallel  plate  waveguide  shown  in  Figure  1,  While  very 
simple,  this  structure  is  useful  for  simulating  normal  incidence  on  a  periodic  grid  of 
scatterers  or  antennas  for  array  analysis 
[10].  It  also  provides  a  very  convenient 
way  of  numerically  determining  mesh 
dispersion  along  a  coordinate  axis  (in  this 
case  the  z-axis)  for  various  mesh  grading 
schemes. 

The  parallel  plate  guide  is  meshed  with 
two  cells  in  the  x-direction  and  one  cell  in 
the  y-direction.  Note  that  the  number  of 
cells  is  irrelevant  in  these  directions  since 
there  is  no  field  variation.  The  scheme 
used  was  chosen  only  for  ease  of  boundary  definition  in  the  FDTD  code.  For  the  first 
example  the  guide  was  meshed  uniformly.  Figure  2  shows  the  propagation  constant  error 
for  six  different  graded  meshes  obtained  from  the  two  algorithms.  Note  that  the  x  axis  is 
plotted  in  terms  of  the  minimum  cell  size  in  the  mesh  divided  by  the  wavelength  where  the 
minimum  cell  size  is  1  in  all  cases.  In  the  axial  direction  it  is  well  known  that  a  uniformly 
graded  condensed  node  TLM  mesh  has  no  dispersion  and  this  is  readily  seen  in  Figure  2. 
As  the  cell  size  in  the  z-direction  is  increased  the  error  also  increases.  It  is  interesting  to 
note  that  for  TLM  the  error  is  a  function  only  of  the  grading  in  the  direction  of 
propagation,  while  for  FDTD  grading  the  cells  in  the  transverse  directions  has  a  slight 
effect  on  the  error  for  a  constant  cell  size  in  the  propagation  direction.  In  all  cases  the 
TLM  results  display  a  smaller  error  than  the  corresponding  FDTD  results.  It  must  be 
realized  that  this  conclusion  is  only  valid  for  waves  traveling  in  the  axial  direction.  The 
error  is  also  a  function  of  propagation  direction  which  is  not  evident  here.  However, 
when  using  a  parallel  plate  waveguide  for  simulating  periodic  stmctures  propagation  is  in 
the  axial  direction  only. 


Figure  1 .  Geometry  of  parallel  plate 
waveguide. 
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Figure  4.  Propagation  error  for  (a)  FDTD  and  (b)  TLM 
algorithms  for  the  case  of  constant  growth  rates 
in  cell  size  over  a  fixed  mesh  region. 


very  interesting  to  note,  however,  that 
the  propagation  error  for  the  TLM 
algorithm  is  substantially  less  than  for 
the  FDTD  algorithm.  In  fact,  the 
maximum  error  for  the  8  cell  case  with 
TLM  is  less  than  the  maximum  error 
for  the  all  but  the  15  cell  FDTD  case. 
For  the  8  cell  case  the  maximum 
“electrical  cell  size”  defined  as  AmaxAo 
is  0.2828  which  is  well  over  the 
usually  cited  maximum  electrical  cell 
size  of  0.10.  The  undulations  in  the 
curves  are  due  to  numerical  reflections 
from  the  absorbing  boundaries  and 
could  be  eliminated  by  the  use  of  a 
better  ABC. 

For  the  final  example  involving  the 
parallel  plate  waveguide,  the  effect  of 
an  abrupt  change  in  cell  size  in  the 
propagation  direction  is  investigated. 
Consider  Case  C  in  Table  1  where  the 
maximum  cell  size  is  2.015.  This 
maximum  cell  size  is  attained  by 
gradually  increasing  the  cell  size  by  a 
constant  growth  ration.  To  see  the 
effects  of  an  abrupt  change  in  cell  size, 
we  have  run  the  case  where  the  cell 


size  changes  from  1  to  2.015  at  the  beginning  of  the  growth  region  and  is  maintained  to 
the  far  end  of  the  mesh.  The  results  obtained  from  the  FDTD  and  TLM  algorithms  for  the 
propagation  error  are  shown  in  Figure  5.  It  is  observed  that  an  abrupt  change  results  in 
larger  errors  than  when  the 
maximum  cell  size  is  reached 
gradually.  Once  again  it  is 
interesting  to  note  that  the  TLM 
error  is  significantly  less  than  the 
FDTD  error  in  both  cases.  In 
fact,  the  TLM  error  for  the 
abrupt  change  in  ceU  size  is  less 
than  the  FDTD  error  for  a 
gradual  change  in  cell  size. 


All  of  the  results  thus  far  have 
been  for  waves  propagating  in  a 
direction  parallel  to  a  mesh 
coordinate  axis  in  a  parallel  plate 
waveguide.  Now,  consider 
propagation  in  a  rectangular 
waveguide  operating  in  the  TEio 
mode.  In  this  situation  the 
waves  propagate  at  an  angle  to 
the  mesh  coordinate  axis  and  this 
angle  depends  on  the  frequency 
of  operation.  For  the  following 
results  a  standard  WR-90 
waveguide  has  been  used. 


Figure  5.  Propagation  error  for  (a)  FDTD  and  (b) 
TLM  algorithms  for  gradual  and  abrupt 
changes  in  cell  size. 


First  a  section  of  WR-90  with  a 
uniform  mesh  grading  will  be 

considered.  Figure  6  shows  the  propagation  error  for  two  uniformly  graded  meshes.  The 
trends  are  the  same  as  those  for  the  parallel  plate  guide,  i.e.,  “stretching”  the  mesh  in  the 
direction  of  propagation  results  in  a  larger  propagation  error  and  the  TLM  error  is  less 
than  the  FDTD  error.  An  interesting  point  to  note  is  that  for  the  stretched  TLM  mesh 
there  is  actually  less  propagation  error  below  10  GHz  than  for  the  ungraded  mesh.  This  is 
likely  due  to  the  dependence  of  numerical  dispersion  on  the  propagation  angle  (measured 
away  from  the  propagation  direction)  which  varies  from  61°  at  7.5  GHz  to  26°  at  15  GHz. 


As  a  final  example  we  consider  the  propagation  error  associated  with  a  numerical  mesh 
constructed  to  solve  a  “real”  problem.  The  problem  is  a  resonant  waveguide  iris  in  WR- 
90  waveguide  and  is  shown  in  Figure  7.  The  details  for  the  non-uniformly  graded  mesh 
used  in  the  simulation  are  given  in  Table  2.  The  mesh  was  constructed  such  that  fine  cells 
were  used  in  the  vacinity  of  the  infinitesimally  thin  iris  edges  and  increased  in  size  away 
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from  these  regions  of  large  field  gradients.  The  maximum  cell  size  occurs  in  the  direction 
of  propagation  and  is  six  times  larger  than  the  smallest  cell  dimension. 


The  sample  field  points  chosen  for 
the  calculation  of  the  propagation 
error  were  located  at  the  input  and 
output  ports  of  the  device.  These 
were  the  same  points  that  would  have 
been  chosen  for  the  calculation  of  the 
S-parameters  of  the  iris.  The 
simulations  were  run  without  the  iris 
present  in  the  numerical  models  in 
order  to  quantify  the  propagation 
error  associated  with  waves  traveling 
through  the  highly  non-uniform 
graded  mesh.  The  resulting 
propagation  error  is  shown  in  Figure 
8.  The  “noise”  present  in  the  data  is 
due  to  the  sensitivity  of  the 
propagation  error  to  the  ABCs  used. 
The  ABC  parameters  were  adjusted 
to  optimize  absorption  at  various 
angles  of  incidence  and  these  changes 
also  changed  the  ripple  pattern  in  the 
data.  The  overall  trend  of  the  error 
levels  remained  consistent  however. 
Use  of  a  higher  order  ABC  is  likely  to 
algorithms  for  WR-90  waveguide  with  uniformly  improve  the  appearance  of  this  data. 

For  the  present  simulations  identical 
ABC  parameters  were  used  for  the 
FDTD  and  TLM  simulations.  This 
result  is  actually  quite  interesting  in 
the  sense  that  the  propagation  error 
has  been  calculated  on  a  realistic  mesh 
that  is  used  to  solve  a  practical 
waveguide  discontinuity  problem.  By 
computing  the  propagation  error 
accumulated  by  the  waves  propagating 
in  the  mesh  from  the  input  to  the 
output  point,  it  may  be  possible  to 
determine  an  accuracy  bound  on  the 
results.  Alternatively,  for  a  required 
phase  accuracy,  it  may  be  possible  to 
determine  the  maximum  permissible  cell  size  that  the  mesh  can  be  expanded  to.  The  data 
for  propagation  error  computed  here  would  seem  to  indicate  that  a  larger  cell  size  can  be 


graded  meshes.  The  curves  are  defined  as  (a) 
Az=Ax=Ay  and  (b)  Az=2Ax=2Ay. 


jFigure  7.  WR-90  waveguide  with  resonant  iris.  Iris 
aperture  dimensions  are  0.65x0.14  inches. 


(b) 

Figure  6.  Propagation  error  for  (a)  FDTD  and  (b)  TLM 
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used  for  a  given  accuracy  using  the  TLM  method.  However,  there  are  many  other  factors 
to  be  considered  before  any  such  conclusions  can  be  drawn  from  any  of  the  data  presented 
here.  These  and  other  key  considerations  are  discussed  in  the  following  section. 


TABLE  2 
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Figure  8.  Propngation  error  for  waveguide  iri.s 
inesli. 


CONCLUSIONS 

In  this  paper  we  have  presented  a  comparison  of  the  propagation  error  associated  with  the 
FDTD  and  TLM  methods  when  applied  to  practical  waveguiding  problems.  While  many 
studies  have  been  performed  to  analytically  characterize  numerical  dispersion  in  both 
techniques,  very  little  work  has  been  done  to  quantify  the  error  in  non-uniform  graded 
meshes.  While  the  results  seen  here  are  very  interesting,  one  must  exercise  extreme 
caution  before  drawing  conclusions  from  this  very  limited  data  set. 

For  all  cases  considered  here  the  TLM  method  has  displayed  smaller  propagation  errors 
than  the  FDTD  method.  This  would  seem  to  indicate  that  when  applying  TLM  a  coarser 
mesh  could  be  used  for  a  given  level  of  accuracy.  However,  there  are  many  other  factors 
that  must  be  considered  before  drawing  such  rash  conclusions.  It  is  well  recognized  that 
condensed  node  TLM  schemes  require  more  storage  per  node  than  Yee’s  FDTD  scheme. 
Further,  the  number  of  operations  to  update  each  node  is  greater  for  TLM.  Whether  or 
not  this  larger  requirement  of  computational  resources  can  be  offset  by  the  use  of  a 
coarser  non-unifonn  mesh  requires  further  investigation.  Another  area  that  hasn’t  been 
considered  here  concerns  the  description  of  boundaries  and  fine  features  within  the  mesh. 
Any  advantages  gained  via  the  ability  to  use  a  coarse  mesh  to  obtain  a  given  level  of 
propagation  error  is  rendered  useless  if  that  mesh  is  too  coarse  to  model  the  fine  features 
of  the  structure  under  investigation.  The  development  of  “smart  cells”  which  allow  fine 
features  to  be  modeled  in  a  relatively  coarse  mesh  is  crucial  in  order  to  be  able  to  exploit 
non-uniform  meshing  to  its  fullest  potential  in  both  methods. 
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Finally,  there  has  been  quite  a  bit  of  discussion  recently  concerning  the  existence  of 
spurious  modes  in  condensed  node  TLM  schemes  [8].  These  are  non-physical  solutions 
that  have  the  potential  to  corrupt  the  actual  solution.  While  there  is  little  doubt  from  a 
theoretical  standpoint  that  these  spurious  modes  modes  can  exist,  their  impact  on 
solutions  to  practical  problems  is  questionable. 
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Abstract 

The  accuracy  of  the  frequency-domain  TLM  (FDTLM)  simulation  is  directly  related  to  the  dis¬ 
persion  error  produced  by  the  FDTLM  nodes.  This  paper  presents  a  direct  comparison  between  the 
various  FDTLM  nodes  and  their  dispersion  characteristics.  A  class  of  FDTLM  symmetrical  con¬ 
densed  nodes  (SCNs)  have  been  characterized  in  terms  of  rearranging  the  order  of  the  node  ports  md 
determining  both  the  characteristic  admittance  and  the  propagation  constants  of  the  link  transmission 
lines.  The  dispersion  characteristics  of  these  FDTLM  SCNs  are  analyzed  by  the  numerical  solution  of 
an  eigenvalue  equation. 

Introduction 

The  frequency-domain  TLM  (FDTLM)  method  has  been  developed  recently  as  a  frequency- 
domain  counterpart  to  the  time-domain  TLM  (TDTLM)  method  [1-2].  In  contrast  to  the  TDTLM,  the 
FDTLM  works  entirely  in  the  frequency  domain,  that  is  no  time  iterations  are  involved  and  thus  no 
Fourier  transform  is  required  to  determine  the  frequency  response  of  the  simulated  circuits.  The 
“workhorse”  for  the  FDTLM  discretization  of  a  circuit  is  the  symmetrical  condensed  node  (SCN), 
which  was  first  introduced  by  Jin,  Vahldieck  and  Huang  (i.e.  [1-3])  and  the  formulation  was  based  on 
P.  B.  Johns’  SCN  [4],  The  FDTLM  SCN  is  essentially  an  intersection  of  six  transmission  lines  with 
certain  propagation  constants  and  characteristic  impedances  illustrated  in  Fig.  1.  First,  the  12-port 
symbolic  scattering  matrix  is  found  at  the  center  if  the  node  from  the  power  conservation  law.  Then, 
the  scattering  matrix  at  the  ports  of  the  node  is  obtained  by  shifting  the  reference  planes  from  the  cen¬ 
ter  to  the  ports  of  the  nodes.  To  do  so,  the  propagation  characteristics  of  the  link  lines  must  be  deter¬ 
mined.  There  are  several  approaches  to  accomplish  this  and  each  of  the  approaches  lead  to  a  different 
port  scattering  matrix.  Huang  has  investigated  the  following  possibilities  [5]:  The  propagation  con¬ 
stant  is  the  same  on  all  link  lines  and  the  characteristic  admittances  are  calculated  accordingly  (Prop¬ 
agation  Constant  Node)',  the  characteristic  impedance  is  the  same  on  all  link  lines  and  the  propagation 
constant  is  calculated  accordingly  (Characteristic  Admittance  Node)',  the  SCN  was  considered  a 
hybrid  node  represented  by  three  series  and  shunt  nodes  coupled  at  the  center  of  the  nodal  point, 
whereby  the  link  lines  may  be  considered  as  either  series  or  shunt  lines.  In  [5]  the  link  lines  were  only 
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considered  as  series  lines.  By  defining  the  characteristic  impedances  on  the  three  sets  of  series  link 
lines,  the  propagation  constants  can  be  calculated  {Hybrid  Node  with  Series  Decomposition).  To  com¬ 
plete  this  investigation,  the  hybrid  SCN  with  shunt  link  lines  was  considered  in  [6]  {Hybrid  Node 
with  Shunt  Decomposition).  In  the  present  paper  a  brief  review  of  the  different  node  representations 
will  be  provided.  Since  the  accuracy  of  the  FDTLM  simulation  is  directly  related  to  the  dispersion 
error  produced  by  the  FDTLM  nodes  a  direct  comparison  will  be  given  between  the  various  FDTLM 
nodes  and  their  dispersion  characteristic,  in  particular  for  the  graded  mesh  and  high  permittivity 
materials.  Before  we  begin  it  should  be  noted  that  both  hybrid  frequency  domain  nodes  {Hybrid 
Node  with  Series  Decomposition  and  Hybrid  Node  with  Shunt  Decomposition)  are  different  from  the 
hybrid  nodes  in  the  time-domain.  In  the  latter  the  characteristic  impedances  or  admittances  of  the  link 
lines  are  real  quantities  and  are  varied  to  account  for  a  graded  mesh  as  well  as  to  model  the  property 
of  the  medium.  To  maintain  time-synchronism,  shunt-circuited  open  stubs  are  needed  throughout  the 
mesh  if  an  inhomogeneous  propagation  medium  is  modelled  or  if  losses  need  to  be  accounted  for. 
The  resulting  symbolic  scattering  matrix  in  the  time-domain  hybrid  node  contains  only  real  elements 
and  its  size  can  vary  from  12x12  (lossless  homogeneous  case)  to  18x18  (lossy  electric  and  magnetic 
material).  In  the  frequency-domain  hybrid  node  (as  in  all  FDTLM  nodes),  time  synchronism  need  not 
be  preserved  and  thus  stubs  are  eliminated.The  electric  and  magnetic  losses  are  included  in  the  vari¬ 
ous  characteristic  impedances  and  propagation  constants  of  the  link  lines  which  become  now  com¬ 
plex  quantities.  However,  the  size  of  the  scattering  matrix  is  always  12  x  12. 

Theory 

The  equivalent  symmetrical  condensed  FDTLM  node  is  sketched  in  Fig.l.  In  the  frequency- 
domain  TLM,  time-harmonic  variations  are  assumed  and  hence  only  the  discretization  of  space  is 
required.  Incident  voltage  waves  on  each  port  are  scattered  to  couple  into  other  ports.  By  following 
the  equivalences  of  the  voltage  and  current  to  the  electric  and  magnetic  fields,  the  differential  equa¬ 
tions  that  described  the  behavior  of  voltages  and  currents  at  a  symmetrical  condensed  FDTLM  node 
are  given  in  the  form  as  shown  below. 
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where  u,  v  and  w  are  the  dimensions  of  the  node  in  x,  y,  and  z  directions,  respectively.  The  difference 
to  reference  Il~2][5]  and  this  paper  is  that  here  we  have  implemented  the  node’s  notation  proposed 
by  Krumpholz  and  Russer  [7]  and  a  symmetric  scattering  matrix  is  obtained,  which  lead  to  a  Hermi- 
tian  matrix  in  the  case  of  lossless  media.  The  nodal  inductances  and  capacitances  of  the  TLM 
branches  are  calculated  from  the  relationship  for  the  distributed  inductances  and  capacitances  of  the 
link  lines  in  all  six  polarized  directions  as  follows: 
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Fig.  1  The  equivalent  symmetrical  condensed  FDTLM  node. 

The  scattering  matrix  of  the  frequency- domain  symmetrical  condensed  node  can  be  presented  in  the 
following  form  with  compact  blocks  and  symmetry  on  the  main  diagonal  [6]: 
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The  normalized  characteristic  impedances  of  the  link  transmission  lines  are  associated  with  the  field 
polarization  and  defined  as  follows: 


Matrix  (3)  is  symmetric  and,  generally,  the  elements  are  complex.  In  case  of  a  lossless  medium,  the 
matrix  becomes  Hermitian. 

The  next  step  in  the  development  is  to  shift  the  symbolic  scattering  matrix  from  the  center  of  the 
node  to  the  ports.  This  is  possible  only  after  the  characteristic  impedances  and  the  propagation  con¬ 
stants  of  the  link  lines  are  determined  [5~6]: 

1.  The  Characteristic  Admittance  Node:  In  this  FDTLM  node  the  characteristic  admittance  of 
each  link  transmission  line  is  assumed  to  be  the  same  and  equal  to  the  intrinsic  admittance  of  the 
medium.  The  normalized  propagation  constants  on  each  branch  can  then  be  computed  from  the 
node’s  dimensions  as 

k,  2{  vw 
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In  general,  these  propagation  constants  are  complex  when  modelling  the  permittivity  and  permeabil¬ 
ity  in  lossy  media. 

2.  The  Propagation  Constant  Node:  In  contrast  to  the  previous  approach,  the  characteristic 
admittances  of  the  link  lines  are  different  and  have  been  selected  as  follows: 


If  the  characteristic  admittances  are  so  selected,  the  propagation  constants  along  all  12  link  lines  are 
the  same  and  equal  to  1/2  of  the  intrinsic  propagation  constant  ko  of  the  media.  The  disadvantage  of 
this  node  is  that  due  to  the  same  propagation  constant  a  graded  mesh  cannot  be  utilized. 


3.  The  Hybrid  Node:  In  this  case  the  SCN  is  considered  as  three  shunt  nodes  and  three  series 
nodes  coupled  at  the  center  of  the  node.  In  the  hybrid  node  with  series  decomposition  three  character¬ 
istic  admittances  are  chosen  which  are  related  to  the  inductances  of  the  series  link  lines,  i.e., 
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The  normalized  propagation  constants  of  all  link  lines  are  then  given  as 


Try  SU 

Try"  jk  ~  2,2 

^0  «  +  V 


-  _  .yy 


siu^  +  v'^) 
.  7„  w(«^  +  V^-5^) 

5{«2  +  v2) 


*0  5(«'  +  v') 

Tzv  - 


On  the  other  hand,  in  the  hybrid  node  with  shunt  decomposition  three  characteristic  admittances  of 
the  corresponding  shunt  transmission  lines  are  chosen,  which  are  related  to  the  capacitance  of  the  link 
lines,  i.e., 

V  =  y  =  fi  ■  y  =  y  =  H  •  y..  =  K  .=  ^  S  ;  O) 

where  j  =  7  ( +  v^w^)  /(u^  +  v^+  w^)  ;  u,  v  and  w  are  the  dimensions  of  the  node  in  x,  y,  and  z 
directions,  respectively.  The  normalized  propagation  constants  of  the  corresponding  link  transmission 
lines  can  be  derived  as: 
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According  to  Floquet’s  theorem  applied  to  a  three-dimensional  TLM  mesh,  the  dispersion  char¬ 
acteristics  for  the  frequency-domain  TLM  nodes  are  based  on  the  numerical  solution  of  an  eigenvalue 
equation  which  is  an  implicit  function  of  the  propagation  constant  kfJ^  of  the  wave  travelling  in  the 
medium  and  the  mesh  propagation  constants  k^,  ky  and  k^  along  the  y,  and  z  directions.  The  eigen¬ 
value  equation  can  be  expressed  in  the  form  [8~10]: 

det(PSp-!)=Q 

where  /isal2xl2  unit  matrix,  P  is  a  matrix  containing  the  mesh  propagation  constants,  and  Sp  is  the 
scattering  matrix  of  the  frequency-domain  TLM  nodes  which  are  related  to  the  ports  of  the  twelve 
link  transmission  lines.  It  should  be  noted  that  in  contrast  to  [8~9,]  matrix  T  no  longer  exists  in  equ. 
(11)  because  the  matrix  elements  are  incorporated  into  Sp,  the  scattering  matrix  at  the  ports  of  the 
node. 

Results  and  Discussions 

To  verify  the  validity  and  accuracy  of  the  class  of  FDTLM  nodes,  the  dispersion  characteristics 
of  a  suspended  stripline  is  investigated  and  the  results  are  shown  in  Fig.  2.  The  number  of  nodes 
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along  the  x-axis  is  chosen  as  8  and  along  the  y-axis  as  7.  A  graded  mesh  arrangement  is  used.  Com¬ 
pared  with  other  FDTLM  nodes,  it  can  be  seen  that  the  results  from  the  frequency-domain  hybrid 
node  converges  better  to  the  results  of  the  spectral  domain  approach  (SDA)  [5].  The  error  can  be  fur¬ 
ther  reduced  by  refining  the  mesh  discretization.  The  flexibility  of  the  frequency- domain  hybrid  node 
is  further  demonstrated  by  calculating  the  S-parameters  of  some  microstrip  discontinuities  [6]. 

The  dispersion  relations  of  the  FDTLM  nodes  can  be  obtained  from  the  general  equation  (11). 
Fig.  3  illustrates  the  relative  dispersion  errors  of  the  propagation  constant  node  and  characteristic 
admittance  node  [5]  compared  with  those  of  the  hybrid  node  [6]  for  a  plane  wave  propagation  in  the 
y-z  plane.  The  x-axis  represents  the  angle  between  the  propagation  vector  and  the  y-direction  is  given 
as  ^=arctan(kyA-}.  The  y-axis  represents  the  relative  propagation  error  given  as  (l/:l  -  where 

1^:1  is  written  as  IA:i  =  {k^  4-  and  is  the  propagation  constant  of  the  medium.  The  results  are 
given  for  a  graded  mesh  where  k^^u  =  k^^v  =  O.Itu,  0.2:i  and  8^  =  =  1.  The  factor  kgd  is 

directly  proportional  to  the  node’s  spacing,  because  the  FDTLM  mesh  usually  corresponds  to  fre¬ 
quencies  whose  wavelength  corresponds  to  1/10  of  a  FDTLM  cell.  It  can  be  seen  from  Fig.  3  that  the 
relative  dispersion  error  for  the  frequency-domain  hybrid  node  is  always  positive  and  much  less  than 
those  of  the  other  nodes. 

In  the  next  example,  we  investigate  the  dispersion  effect  in  a  dielectric  medium  with  a  graded 
mesh.  Fig.  4.  illustrates  a  family  of  curves  representing  plane  wave  propagating  in  a  graded  mesh  in 
the  y-z  propagation  plane.  The  permittivity  of  the  dielectric  medium  is  =  12.  The  normalized  prop¬ 
agation  vector  is  plotted  in  a  polar  representation  for  the  propagation  constant  node  and  the  character¬ 
istic  admittance  node.  The  shunt  and  series  decomposition  nodes  are  not  shown,  because  their 
dispersion  is  too  close  to  the  unit  circle.  The  vector  describes  the  unit  circle  for  the  infinitesimal 
mesh  case,  indicating  no  dispersion  occurs  for  the  FDTLM  nodes  at  low  frequencies,  even  for  a 
graded  mesh.  When  a  coarse  discretization  is  selected,  for  example  =  k^v  =  O.OSti,  0.1  jt, 
the  wavelength  can  not  be  considered  very  large  compared  to  the  graded  cell,  and  the  phase  velocity 
becomes  frequency  dispersive  and  depends  on  the  propagation  direction.  The  maximum  dispersion 
occurs  in  the  axial  direction  for  the  characteristic  admittance  node.  In  this  example  the  parameter  w  is 
larger  than  v;  and  the  dispersion  depends  on  the  node’s  dimension  along  the  propagation  directions. 
Thus,  the  dispersion  is  higher  in  z-direction.  A  comparison  of  the  relative  dispersion  errors  in  the  dif¬ 
ferent  graded  FDTLM  nodes  in  the  y-z  propagation  plane  is  shown  in  Fig.  5.  The  highest  dispersion 
occurs  in  the  characteristic  admittance  node.  The  dispersion  error  is  smaller  for  the  hybrid  node  than 
for  both  the  propagation  constant  node  and  the  characteristic  admittance  node  solutions,  for  all  angles 
of  propagation  in  the  y-z  plane.  This  better  accuracy  and  lower  dielectric  propagation  error  for  the 
hybrid  node  compared  to  other  nodes  may  be  due  to  the  fact  that  the  link  lines  take  into  account  the 
line  capacitance  rather  than  the  link  line  inductances  as  in  the  series  decomposition  [5].  The  disper¬ 
sion  analysis  is  also  consistent  with  the  results  shown  in  Fig.  2. 
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Conclusion 


In  this  paper,  a  class  of  FDTLM  SCNs  has  been  reviewed  and  compared  in  terms  of  their  accu¬ 
racy.  Numerical  results  show  that  the  hybrid  FDTLM  node  shows  much  less  dispersion  than  other 
FDTLM  nodes  and  that  the  shunt  decomposition  scheme  is  the  most  accurate  of  all. 
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Fig.  2  The  dispersion  characteristics  of  a  suspended 
stripline  obtained  with  the  different  ftequency- 
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Fig.  4  Dispersion  characteristics  of  a  graded  mesh  of  dif¬ 
ferent  FD-TLM  nodes  in  the  y-z  propagation  plane 
(dielectric),  k^u  =  k^v  -  0.05;:,  k^w  =  O.Itt,  = 
12,  and  =  1.  where  k^  is  the  medium  propaga¬ 
tion  constant. 
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Fig.  3  Comparison  of  the  relative  dispersion  errors  of  the 
different  graded  FD-TLM  nodes  in  the  y-z  propa¬ 
gation  plane  (free  space),  k^u  =  k^v  =  O.ln,  k^w 
=  0.2jr  and  =  p.,.  =  1,  where  k^  is  the  free  space 
propagation  constant. 


0  (degree) 

Fig.  5  Comparison  of  the  relative  dispersion  errors  of  the 
different  graded  FD-TLM  nodes  in  the  y-z  propa¬ 
gation  plane  (dielectric),  k^u  =  k^v  =  O.OStc,  k^w  = 
0.1  Jt,  £^=  12,  and  =  1,  where  is  the  medium 
propagation  constant. 


286 


Distributed  Simulation  of  Planar  Circuits 
by  TLM  Method  in  a  Parallel  Computing 
Environment 

B.  Isele,  J.  Schmoller,  P.  Russer 


Lehrstuhl  fiir  Hochfrequenztechnik.  Technische  Universitat  Miinchen,  Arcisstr.  21, 
D-80333  Miinchen  2,  Germany,  phone  +49(0)89/21058390,  fax  +49(0)89/21053365 

Abstract 

The  time-domain  simulation  of  coplanar  waveguide  (CPW)  elements  for  picosecond  pulse 
applications  is  described.  The  circuit  characteristics  of  the  CPW  elements  were  simulated 
in  time  domain  using  the  3D  transmission-line-matrix  (TLM)  method.  We  present  a  cost- 
effective  approach  for  the  time— domain  simulation  of  coplanar  circuit  structures  utilizing 
distributed  computing  within  a  parallel  software  environment.  The  application  of  TLM 
method  and  distributed  computing  to  the  efficient  analysis  of  several  coplanar  structures 
is  presented. 


Introduction 

The  present  work  arose  from  the  requirement  of  an  accurate  analysis  of  planar  circuits  for 
monolithic  microwave  integrated  circuits  (MMICs).  Time-domain  modeling  enables  a  di¬ 
rect  broadband  simulation  utilizing  arbitrary  excitation  signals,  which  may  be  useful  in  the 
investigation  of  circuits  processing  digital  signals.  In  recent  years,  the  transmission  line  ma¬ 
trix  (TLM)  method  has  emerged  as  a  powerful  tool  to  solve  electromagnetic  field  problems 
in  the  time  domain  [1,  2].  Recently,  efforts  have  been  made  to  improve  the  accuracy  and 
efficiency  of  the  TLM  schemes  based  on  symmetrical  condensed  nodes  [3].  The  occurence 
of  spurious  modes  caused  by  unphysical  excitations  of  TLM  meshes  can  be  minimized  using 
methods  given  in  [4].  In  particular,  the  modeling  of  planar  circuits  with  several  interfering 
discontinuities  requires  a  TLM  mesh  with  a  large  number  of  nodes.  Thus  the  simulation 
of  complex  microwave  circuits  is  limited  by  the  accessible  computer  memory  and  computer 
performance.  Parallel  and  distributed  computing  provides  a  cost  effective  way  to  overcome 
these  limitations.  Since  the  TLM  algorithm  is  based  on  local  operations  at  neighbouring 
mesh  nodes  only,  the  TLM  method  is  predestinated  for  parallel  computing.  Distributed 
computing  makes  best  use  of  the  available  resources  and  features  the  re-use  of  existing 
software  modules  if  high-level  parallel  programming  interfaces  are  provided.  As  a  parallel 
computing  environment  for  distributed  computing  we  use  the  Parallel  Virtual  Maschine 
(PVM)  V3.X,  which  became  a  standard  environment  for  parallel  computation  [5,  6,  7]  and 
which  is  largely  compatible  to  the  up-coming  (MPI-)  Message-Passing-Standard  for  parallel 
computation.  PVM  supplies  a  platform  for  portable  parallel  programming  and  provides 
a  reliable  communication  interface  also  in  heterogeneous  computing  environments.  Load 
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balancing  is  achieved  by  an  optimum  segmentation  of  the  mesh  based  on  an  initial  guess 
of  the  available  computer  power  at  the  particular  maschines  before  starting  the  simulation 
task.  The  segmentation  is  optimized  also  for  minimum  communication  traffic  between  the 
computing  nodes.  We  are  employing  clusters  of  interconnected  HP-7xx  Workstations  as 
well  as  an  IBM-SP2  supercomputer  with  up  to  56  computing  nodes. 


Simulation  in  a  parallel  computing  enironment 

In  general,  solvers  for  partial  differental  equations  (PDEs)  are  candidates  for  parallel  com¬ 
puting.  In  particular,  methods  yielding  decoupled  systems  of  equations,  are  suited  well  for 
parallelization.  This  implies  that  the  simulation  task  can  be  divided  into  smaller  subtasks, 
without  introducing  much  communication  effort  between  these  subtasks.  Since  the  TLM 
algorithm  is  based  on  local  operations  at  neighbouring  mesh  nodes  only,  the  TLM  method 
is  predestinated  for  parallel  computing. 

Distributed  computing  makes  best  use  of  the  available  resources  and  features  the  re-use 
of  existing  software  modules  if  high-level  parallel  programming  interfaces  are  provided.  As 
a  parallel  computing  environment  for  distributed  computing  we  use  the  Parallel  Virtual 
Machine  (PVM),  which  became  a  standard  environment  for  parallel  computation  [5,  6,  7] 
and  is  largely  compatible  to  the  up-coming  (MPI-)Message-Passing-Standard  for  parallel 
computation.  PVM  supplies  a  platform  for  portable  parallel  programming  and  provides  a 
reliable  communication  interface  also  in  heterogeneous  computing  environments.  Within 
PVM  the  processing  units  interchange  their  information  by  message  passing  techniques, 
which  can  be  implemented  easily  into  TLM.  Some  focus  lies  upon  the  granularity  of  the 
parallelization  problem.  Fine  granularity  involves  considerable  communication  overhead, 
decreasing  the  computational  power,  whereas  a  coarse  granularity  requests  computing  nodes 
of  higher  computational  power.  In  TLM  a  strict  synchronization  pattern  is  given  by  the 
TLM  timestep,  since  each  client  has  to  wait  for  its  adjacent  neighbors  for  completion  of 
one  TLM  iteration.  The  time  needed  for  the  communication  between  all  adjacent  clients 
is  considerably  larger  than  the  accumulated  communication  time  between  each  two  clients 
operating  on  adjacent  submeshes.  The  speedup  for  parallel  computation  is  usually  defined 
as  ratio  between  the  execution  time  on  a  standalone  computer  and  the  execution  time  in  a 
parallel  or  distributed  environment 

-I 

In  multi-user-environments  with  computing  nodes  doing  many  other  jobs  concurrently,  it 
is  senseful  to  measure  the  execution  time  by  means  of  cpu  and  user  time  required  by  the 
simulation  tasks  rather  than  the  staying  time  of  the  simulation  tasks.  The  speedup  defined 
by  cpu  and  user  time  increases  nearly  linear,  whereas  the  performance  measured  by  staying 
time  of  the  simulation  processes  saturates  due  to  communication  bottlenecks  and  idle  times. 
Fig.  2  shows  the  performance  within  a  workstation  cluster  measured  by  accumulation  of 
cpu-  and  user-time  as  well  as  by  the  staying  time  of  the  simulation  tasks.  The  overall 
performance  is  dependent  to  the  utilization  of  all  available  computing  nodes.  On  networks 
with  a  bus  topology,  all  computing  nodes  have  to  share  the  available  communication  band¬ 
width.  The  connection-less  access  mode  in  commonly  used  Ethernet  based  networks,  which 
accepts  even  collisions  of  data  packets,  leads  to  a  further  descent  of  the  throughput  and 
limits  also  the  number  of  computing  nodes  working  on  the  same  network  segment  within 
the  computing  environment.  Therefor  we  admit  only  a  sequential  communication  scheme, 
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allowing  only  one  pair  of  clients  to  communicate  with  each  other  concurrently  during  the 
communication  phase  between  the  TLM  timesteps.  This  guarantees  that  only  one  client 
submits  messages  at  one  time  and  thus  additional  collisions  are  avoided.  Enhanced  net¬ 
working  technologies  like  switched  ethernet  or  FDDI  networks  will  also  reduce  the  commu¬ 
nication  bottlenecks.  Using  specialized  hardware,  such  as  IBM’s  SP2  computer,  consisting 
of  high-speed-interconnected  processing  nodes,  these  problems  became  unprimed,  and  the 
communication  phase  can  be  shortened  by  concurrent  communication  between  pairs  of 
clients.  PVM  supports  the  message-passing  paradigm  and  preserves  compatibility  to  the 
now  up-coming  MPI-(Message-Passing-Standard)  for  portable  distributed  computing. 

An  important  feature  is  the  re-usability  of  existing  software  components.  Since  our  ap¬ 
proach  uses  existing  sequential  code,  which  was  extended  for  communication  and  controlling 
tasks,  re-use  is  introduced  at  a  high-level  and  the  effort  for  the  raaintainance  of  the  software 
package  is  kept  small.  The  decision  starting  a  simulation  task  standalone  or  distributed 
depends  on  the  mesh  size  and  the  actual  available  computing  resources.  The  parallelization 
model  is  given  in  Fig.  1.  We  use  a  hybrid  model  for  the  embedding  of  the  existing  computer 
codes,  leaving  most  of  communication  between  the  working  clients  and  introduce  only  a 
small  overhead  for  controlling  tasks  managed  by  a  master  program. 

PVM  yields  a  very  reliable  communication  interface  also  in  heterogeneous  environments, 
hiding  all  details  of  a  specific  platform.  Also  the  embedding  of  machines  with  very  dif¬ 
ferent  architecture  like  MIMD-and/or  SIMD-computers  is  straightforward  whereby  only 
minor  changes  in  the  computer  code  are  necessary.  The  approach  yields  a  transparent  sim¬ 
ulation  tool  hiding  all  details  of  the  specific  computing  node.  Automatic  space  partitioning 
is  performed  by  a  off-line  preprocessing,  according  to  minimize  the  number  of  variables, 
which  have  to  be  exchanged  and  due  to  the  available  computational  power  of  the  specific 
computing  nodes  and  network  links. 

Simulation  of  planar  circuits 

Planar  circuits  are  designed  for  the  application  in  microwave  circuits  of  high  package  den¬ 
sity  and  are  optimized  for  minimum  square  dimensions  and  minimum  aspect  ratio.  For  such 
problems  the  three-dimensional  TLM  method  is  inherently  suitable,  especially  if  also  multi¬ 
layer  structures  or/and  three-dimensional  discontinuities  such  as  via-holes,  air-bridges  are 
included  into  the  planar  circuit.  The  investigation  of  structures  with  metallic  boundaries 
requires  also  a  correct  modeling  of  the  skin  effect  surface  impedance  of  thin  metallic  bound¬ 
aries,  taking  also  into  account  its  dependence.  In  such  cases  a  conventional  TLM  scheme 
is  not  applicable  for  the  discretization.  We  have  introduced  a  surface  impedance  model, 
which  allows  the  consideration  of  thin  conducting  sheets  enclosed  in  a  TLM  mesh  [10]. 
Therein  the  skin  effect  surface  impedance  is  modeled  with  arbitrary  accuracy  by  lumped 
element  ladder  networks  consisting  of  cascaded  L-R  -  sections.  For  the  extraction  of  S- 
Parameters  high-quality  absorbing  boundary  conditions  are  necessary  especially  in  the  port 
regions.  One-way  absorbing  boundary  conditions  lack  from  their  stability,  often  limiting 
the  length  of  the  observable  time  response,  whereas  global  boundary  conditions  like  the 
radiating  boundery  condition  based  on  discrete  green’s  function  given  in  [11],  requires  a  lot 
of  nonlocal  communication  between  several  nodes.  This  will  decrease  the  computational 
performance  considerablely  and  will  not  be  advantagous  for  distributed  computing.  The 
use  of  absorbing  materials  for  modeling  accurate  boundery  conditions  found  growing  inter¬ 
est  in  the  finite-difference  community  and  may  by-pass  this  problems.  Berenger  [12]  found 
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that  non-physical  materials  comply  with  the  relation 


CTe  € 

yields  absorbing  material  for  incident  waves  at  arbitrary  angles.  Utilizing  several  layers  with 
quadratically  increasing  conductivity  Ue  and  magnetic  conductivity  Om-  a  perfect  matching 
layer  (PML)  is  constructed.  The  PML  technique  can  be  implemented  easily  into  the  TLM 
method  preserving  the  locality  of  the  TLM  algorithm. 

Simulation  examples 

We  demonstrate  the  application  of  distributed  TLM  simulator  for  the  modeling  the  elec¬ 
tromagnetic  field  in  complex  planar  circuits.  In  the  current  work  we  investigate  CPW 
elements  for  MMICs.  Planar  circuits  are  designed  for  the  application  in  microwave  circuits 
of  high  package  density  and  are  optimized  for  minimum  square  dimensions  and  minimum 
aspect  ratio.  For  such  problems  the  three-dimensional  TLM  method  is  inherently  suitable, 
especially  if  also  multi-layer  structures  or/and  three-dimensional  discontinuities  such  as 
via-holes,  air-bridges  are  included  into  the  planar  circuit. 

We  have  investigated  several  coplanar  structures  including  discontinuities,  meander  lines 
and  spirals  as  well  as  self-similar  structures  such  as  the  3/2-line  and  Hilbert  curves,  in 
which  the  structures  are  distributed  evenly  over  the  dedicated  area.  The  Hilbert  curves 
represent  optimum  space-filling  curves  with  appropriate  aspect  ratios.  Fig.  3  shows  the 
schematic  of  a  modified  hilbert  curve  and  its  time  response.  In  Fig.  4  the  field  distribution 
is  shown  at  different  instants  of  time. 

In  Fig.  5  the  schematic  of  a  coplanar  35-to-50r2-transition  is  given,  which  has  been  in¬ 
vestigated  previously  by  [13].  The  geometrical  parameters  at  the  350  line  are  given  by 
the  width  of  the  center  conductor  w  =  20/im  and  the  gap  width  s  =  5/im,  the  parameters 
at  the  50  0  line  are  given  by  w  =  15/.ira  and  the  gap  width  s  =  10/im.  The  simulation 
was  performed  for  an  GaAs  substrate  with  an  dielectric  constant  e  =  12.9.  The  input 
reflection  coefficient  is  shown  by  Fig.  6  in  magnitude  and  phase.  Currently  we  are  working 
on  simulation  of  interconneting  structures  and  multilayer  structures. 

Conclusion 

We  have  presented  an  extremely  cost-effective  approach  for  the  time-domain  simulation  of 
coplanar  circuit  structures  wuth  the  TLM  method  utilizing  a  parallel  software  environment. 
The  use  of  matched  layers,  skin  effect  model  and  performance  aspects  are  discussed.  The 
feasibility  of  the  approach  is  demonstrated  by  the  analysis  of  several  complex  coplanar 
structures. 
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Figure  1;  Parallel  Programming  Model  using  the  Parallel  Virtual  Machine  (PVM) 
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Problem  size  320x32x12 
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Figure  2;  Performance  of  distributed  TLM  simulation 
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Figure  3:  Time  Response  of  the  Modified  Hilbert  Curve 
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Abstract 

In  this  paper  the  symmetrical  condensed  node  (SCN)  TLM  mode!  of  gyromagnetic  materi¬ 
als  is  described.  The  constitutive  relations  of  the  medium  are  represented  by  equivalent  sources 
connected  to  the  node.  The  resulting  TLM  scattering  matrix  is  independent  of  the  material 
behavior.  The  equivalent  sources  are  calculated  at  each  timestep  and  included  in  the  scattering 
procedure  of  TLM.  This  approach  is  robust,  general,  and  capable  of  modeling  the  frequency 
dependent  characteristics  of  the  gyromagnetic  medium  in  time  domain.  The  technique  has  been 
implemented  and  used  in  the  simulation  of  a  resonance  ferrite  Isolator. 

Introduction 

The  TLM  method  yields  the  numerical  solution  of  Maxwell’s  equations  using  the  discretized 
Huygens  principle  [1].  The  technique  is  efficient,  general,  and  useful  for  solving  complex  electromag¬ 
netic  problems.  In  TLM  the  medium  constants  are  usually  represented  by  open  and  short  circuited  stubs 
connected  to  the  node,  [2],  This  works  well  for  frequency  independent  permittivity  and  pemeability; 
however,  in  more  complex  relationships,  other  approaches  must  be  used. 

A  major  research  effort  in  TLM  is  devoted  to  modeling  of  electromagnetic  constitutive  relation¬ 
ships  ([3]-[5]),  which  describe  the  interaction  of  electric  and  magnetic  fields  with  the  medium.  This 
work  shows  a  simple  procedure  to  represent  the  behavior  of  gyromagnetic  media  in  the  TLM  algorithm. 

The  medium  behavior  is  represented  by  equivalent  sources  connected  to  the  node.  The  sources 
are  determined  by  the  constitutive  equations  and  the  total  voltages  and  currents  at  the  node.  The  differ¬ 
ential  equations  describing  these  relationships  are  solved  using  state-variable  techniques  with  suitable 
discretization  schemes  [6].  The  resulting  system  of  difference  equations  is  solved  at  each  TLM 
timestep. 

This  technique  has  several  advantages  over  conventional  dielectric  and  magnetic  models  used  in 
TLM.  While  the  use  of  the  stub-loaded  approach  is  more  efficient  for  frequency  independent  media,  the 
proposed  technique  can  model  several  kinds  of  constitutive  equations  with  few  modifications  to  the 
TLM  algorithm.  The  technique  is  robust,  can  be  used  with  several  discretization  schemes,  results  in  a 
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scattering  matrix  independent  of  the  medium  behavior,  [5],  and  can  he  integrated  with  conventional  cir¬ 
cuit  analysis  programs,  if  the  equivalent  circuit  of  the  medium  behavior  is  available. 


Theory 

The  general  constitutive  relationships  are  modeled  in  the  symmetrical  condensed  node  (SCN) 
TLM  by  decoupling  the  scattering  matrix  from  the  dielectric  and  magnetic  behavior  of  the  medium. 
The  medium  is  represented  by  sources  connected  to  the  node. 

Equivalent  source  approach 

The  SCN  formulation  is  obtained  by  combining  2D  shunt  and  series  nodes,  [7],  decoupled  from 
the  medium.  The  resulting  TLM  procedure  is: 
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where  the  [v]  are  vectors  of  the  branch  voltages,  [vj  is  the  source  vector,  and  the  index  t  indi¬ 
cates  the  transpose  vector.  The  propagation  part  of  the  TLM  method  remains  unchanged.  The  calcula¬ 
tion  of  the  source  incident  voltages  [v^]*  in  terms  of  the  branch  incident  voltages,  via  the  constitutive 
relationship  of  the  medium  and  the  source  reflected  voltages  [vj^  completes  the  algorithm. 

Since  the  node  sources  represent  the  magnetizability  and  polarizability  of  the  material,  the 
source  incident  voltages  are  calculated  by: 
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The  set  of  equations  (l)-(3)  allows  the  modeling  of  arbitrary  constitutive  relationships  in  the 
SCN.  The  medium  equations  (2)-(3)  are  solved  at  each  timestep  by  solving  the  discretized  constitutive 
relationship  differential  equations. 

Equations  (l)-(3)  can  be  solved  using  the  Thevenin/Norton  circuit  equivalents  of  the  node  in  the 
calculation.  The  node  is  replaced  by  equivalent  Norton  circuits  (used  in  the  calculation  of  the  electric 
field),  and  Thevenin  circuits  (used  in  the  calculation  of  the  magnetic  fields).  Fig.  1. 


Fig.  1  Norton  equivalent  circuit  of  the  node  for  the  electric  field  in  the  x  direction. 

The  voltage  and  current  at  the  input  port  of  each  equivalent  circuit  can  be  expressed  as: 
a)  Norton  network  (represents  the  electric  field): 


(4) 


i  (t)  =  2Y/^  (t)  -  y,v  (0  (0  -  V  (0  -  (0 

b)  Thevenin  network  (represents  the  magnetic  field): 


V  (0  =  2z/^ «)  -  z^i  (t)  v‘„(t)  =-  (4i  (0  -  v;;  (0 ) 

These  equations  can  be  rearranged  and  substituted  into  (2),  resulting  in: 
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(')  =  +  V  (0 
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The  equivalent  current  sources  in  the  three  coordinate  directions  (Norton)  are  obtained  from  the 
polarization  currents  i  and  the  total  voltages  -  representing  the  electric  field  components  -  across  the 
node,  similarly  the  eqmvalent  voltage  sources  (Thevenin)  are  obtained  from  the  magnetization  voltages 
^mag  currents  -  representing  the  magnetic  field  components  -  across  the  node: 

(!)  =  2A!^^^  (!)  +  4v^  (!)  (!)  =  2At£m^  (!)  +  4/^  (!) 

8v^j,  (!)  =  2A!^^j  (!)  +  4v^  (!)  Sv;;^,  (!)  =  2A!|my  (!)  +  4!^  (!)  p, 

^'’az  (''>  = 

Since  the  total  voltages  and  currents  across  the  node  are  obtained  in  the  solution  of  (7),  an  elfi- 
cient  way  to  execute  the  TLM  algorithm  is  to  calculate  the  sources  [vj^  at  each  timestep  using,  and  then 
solve  (7)  using  [vj"  for  the  voltages  (v^^,  Vy,  v^)  and  currents  (i^,  iy,  ij.  The  reflected  branch  voltages  can 
be  obtained  by  [7]: 


Vj  =  (0  -  (0 

v^  =  v^(0  +Zr^y(0  -^9 
V3  =  v^(0  -v'jj 

V4  =  ^(0"Z,'^(0-v' 


5  =  v^(0+Z,^;(0-v‘  (0 
V6  =  v^(0  -Z,iy(t)  -v\q 
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v5  =  v^(t)-z^yo-vi 

.  (8) 

^n=v^(0-Z,^,(0-V3 
J2  =  ^.(0+Z^i^(t)-v\ 


This  procedure  is  applicable  to  all  kinds  of  constitutive  relationships.  It  is  adapted  to  usual  TLM 
programs  [2]  by  setting  Y^=Yy=Yz=Z^=Zy=Z^=4.  The  reflected  voltages  for  the  stubs  are  obtained, 
then  (1)  and  (7)  are  used  to  obtain  the  incident  voltages  from  the  stubs,  and  then  the  reflected  voltages 
in  branches  1-12  are  calculated  with  the  scattering  matrix  (1). 
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Solving  of  the  constitutive  equations  in  the  equivalent  source  approach 

The  equivalent  node  sources  allow  the  solution  of  the  network  using  nodal  or  Tableau  analysis, 
[8].  Therefore,  a  SPICE  circuit  simulator  could  be  used  to  solve  the  network  at  all  nodes  at  each 
timestep.  The  problem  of  this  approach  is  the  need  for  an  equivalent  circuit  of  the  medium.  While  this 
may  be  easy  for  most  linear  dispersive  isotropic  materials,  it  is  more  difficult  for  an  arbitrary  constitu¬ 
tive  relationship. 

However,  the  use  of  state-variable  formulation  of  the  constitutive  relationship  equations  avoids 
this  problem.  In  this  approach,  equivalent  circuits  for  the  constitutive  relationships  are  not  necessary, 
since  the  state-equations  can  be  obtained  directly  from  nonlinear  differential  equations.  The  state-vari¬ 
able  technique  is  easily  incorporated  into  the  TLM  simulator  without  loss  of  generality.  A  general  pro¬ 
cedure  for  linear  differential  equations  shown  in  [6]  is  outlined  for  a  medium  described  by  a  fourth 
order  linear  differential  equation: 
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where:  X|(t)=f(t) 

The  resulting  equations  can  be  incorporated  into  TLM  either  by  discretizing  the  solution  or  the 
differential  operator.  The  discrete  solution  of  (10)  would  result  in  a  convolution  procedure,  restricting 
the  use  of  the  formulation  to  linear  materials.  If  the  discretized  operator  approach  is  used,  several  dis¬ 
cretization  schemes  may  be  chosen.  However,  there  are  two  major  schemes  that  are  very  attractive,  for 
reasons  of  efficiency: 

-  Backwards  Euler,  [6] 

-  Trapezoidal  (or  bilinear  mapping),  [6] 

The  first  scheme  introduces  losses  in  the  final  result,  it  is  first-order  accurate,  but  it  is  very  sim¬ 
ple  to  implement.  The  second  scheme  is  accurate  to  the  second  order,  conserves  energy  and  uses  the 
same  kind  of  discretization  as  in  TLM.  Therefore,  for  linear  isotropic  non-dispersive  materials,  the 
results  given  by  this  approach  and  usual  TLM  are  virtually  indistinguishable. 

However,  the  use  of  trapezoidal  schemes  for  the  solution  of  gyromagnetic  materials  can  cause 
instabilities  in  the  result  due  to  the  nature  of  the  constitutive  equation. 

The  discretization  schemes  transform  the  continuous  state  equations: 


=  m  (Ar(r)]  +  [B]  [«(()] 
^(Ol  =  [C]  [I  (01  +  [C]  [«(!)] 
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into  the  discretized  form: 


[;c(t+A0]  =  [P]  [^(0]  +  [Q]  [«(0]  +  [P]  U(?-A0] 

[^(t+AOI  =  [C]  [JC(^  +  A^)]  +  [D]  [M(t+A0] 

The  matrices  [P],  [Q]  and  [R]  will  depend  on  the  discretization  scheme.  In  the  backwards  Euler 
approach: 

[P]  =  ([t/] -A/[A])-‘  ([G]  =  ([[;] -A<[Al)-'Ar[B]  [fl)  =  [0])  (13) 

where  [U]  is  the  nxn  identity  matrix,  and  [0]  is  the  null  matrix. 

Representation  of  the  constitutive  relationships  in  the  state- variable  form 

This  section  describes  the  implementation  of  the  state-variable  equations  for  gyromagnetic 
media.  Considering  a  gyromagnetic  material  subject  to  a  magnetization  field  Hq  in  the  z-direction  with 
magnetization  Mq  in  the  same  direction,  the  linearized  constitutive  equations  for  the  permeability  are: 


dmx 

dt 


where  {Dq  co^  are 


dm 

Jt  - 


(14) 


obtained  from  the  bias  magnetic  field  and  the  saturation  magnetization: 
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T  =  2nx2.8xlO^(«7/Oe) 

The  constitutive  equations  (14)  can  be  expressed  in  the  state  variable  form: 
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where  the  total  magnetic  fields  in  the  x-  and  y-directions  are  calculated  using  (3)  and  (7). 


Results 

The  technique  presented  in  this  work  has  been  used  to  simulate  an  E-plane  resonance  isolator  in 
a  WR90  waveguide,  [9].  The  device  was  designed  to  have  a  30  dB  reverse  attenuation  at  10  GHz.  The 
ferrite  slab  was  0.5  mm  wide,  24  mm  long,  and  placed  at  2.54  mm  from  one  of  the  inner  waveguide 
walls,  with  a  saturation  magnetization  of  1700  Gauss  and  a  bias  magnetic  field  of  2840  Oersteds,  (Fig. 
2).  The  scheme  used  to  solve  (16)  was  the  backwards  Euler  scheme,  because  of  stability  problems 
encountered  with  the  trapezoidal  discretization  approach. 
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Since  the  backwards  Euler  scheme  introduces  numerical  losses,  it  avoided  the  instability  prob¬ 
lems  obtained  using  energy  conservative  schemes.  The  results,  presented  in  Fig.  3,  are  the  transmission 
coefficients  for  the  forward  and  backward  propagation  (S21  and  SI 2). 


Fig.  2  Geometry  of  the  resonance  ferrite  isolator. 
Forward  and  reverse  attenuation 


Fig.  3  Forward  and  reverse  transmission  coefficients  of  the  resonance  isolator. 

The  results  shown  in  Fig.  3  show  less  reverse  attenuation  than  expected.  This  was  caused  by  the 
sensitivity  of  the  results  to  the  placement  of  boundaries,  and  the  difficulty  to  simultaneously  position 
the  slab  and  to  discretize  its  thickness.  However,  there  was  very  good  agreement  of  the  forward  attenu¬ 
ation  (-  2.0  dB  at  10  GHz),  and  of  the  bandwidth  of  the  isolator  (4%). 
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Conclusions 

This  work  presented  the  symmetrical  condensed  node  (SCN)  TLM  model  of  gyromagnetic 
materials.  The  constitutive  relationships  were  represented  by  equivalent  sources  connected  to  the  node. 
The  resulting  TLM  scattering  matrix  was  independent  of  the  material  behavior.  The  equivalent  source 
was  calculated  at  each  timestep  using  state- variable  techniques  and  included  in  the  scattering  procedure 
of  TLM.  The  approach  was  presented  for  general  medium  equations,  and  used  in  the  calculation  of  the 
forward  and  reverse  attenuation  of  a  resonance  ferrite  isolator.  The  main  difficulties  encountered  in  the 
simulation  of  this  structure  were  stability  and  sensitivity  problems.  The  resulting  state-variable  equa¬ 
tions  showed  instability  problems  for  lossless  discretization  schemes,  which  were  solved  by  using 
backwards  Euler  discretization  procedure.  The  energy  loss  associated  with  the  scheme  caused  an 
increase  in  the  forward  and  reverse  attenuation  of  the  isolator.  The  forward  and  reverse  attenuations  of 
the  isolator  were  highly  sensitivity  to  the  positioning  and  width  of  the  ferrite  slab.  Since  its  was  not  pos¬ 
sible  to  model  accurately  both  position  and  width  of  the  slab,  the  computed  s  reverse  attenuation  was 
less  than  the  design  specification.  The  results  can  be  improved  by  using  special  TLM  cells  with  arbi¬ 
trary  boundary  positioning.  This  technique  is  now  being  extended  to  gyromagnetic  materials  with 
losses  and  to  other  kinds  of  TLM  nodes. 
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Abstract 

In  this  paper,  a  comparative  performance  study  of  Higdon’s  absorbing  boundary  conditions 
(ABCs)  in  TLM  and  FDTD  under  identical  conditions  (such  as  same  time  step,  excitation,  etc.)  is 
described.  It  has  been  observed  that  an  ABC  applied  directly  to  the  TLM  voltage  impulses  absorbs 
better  than  the  same  ABC  applied  to  the  TLM  total  node  voltages  or  FDTD  field  values.  It  has  also 
been  found  that  in  a  FDTD  analysis,  the  damping  factors  have  a  greater  influence  on  the  stability 
than  the  way  in  which  the  absorbing  boundary  operators  are  discretized,  while  in  a  TLM  analysis 
the  opposite  is  true.  Also,  in  a  TLM  analysis,  the  absorbing  boundaries  can  be  placed  closer  to  a 
scatterer  than  in  a  FDTD  analysis  without  affecting  the  accuracy.  These  observations  have  been 
demonstrated  by  applying  them  to  examples  such  as  waveguides  and  high  permittivity  dielectric 
cubes. 

Introduction 

The  Transmission  Line  Matrix  (TLM)  [1]  and  Finite-Difference  Time-Domain  (FDTD)  [2]  meth¬ 
ods  are  suitable  for  time-domain  simulation  of  complex  guiding  and  radiating  structures.  The 
TLM  method  is  a  physical  discretization  model  whereas  FDTD  is  a  mathematical  discretization 
model.  The  TLM  method  has  advantages  in  boundary  description,  while  the  FDTD  method 
requires  less  computer  memory.  Recently,  we  have  developed  real-time  interfaces  between  TLM 
and  FDTD  modules  [3-4],  and  Marcysiak  and  Gwarek  [5]  have  given  the  equivalent  excitation 
schemes  that  produce  identical  field  values  at  all  time  steps  in  the  expanded-TLM  and  FDTD 
meshes. 

Good  quality  ABCs  are  very  important  for  time-domain  simulations  of  electromagnetic  structures. 
Even  though  a  number  of  ABCs  have  been  reported  in  the  literature,  Higdon’s  ABCs  [6-8]  have 
been  found  to  perform  better  than  others  [9].  The  advantages  of  Higdon’s  ABCs  are  that  prior 
information  about  the  incident  angles  can  be  used  to  optimize  the  absorption  in  the  required  fre¬ 
quency  bandwidth,  and  long  term  stability  can  be  achieved  by  using  proper  discretization  of  the 
boundary  operators  (derivatives)  and/or  by  providing  damping  factors.  Very  low  reflections  can  be 
obtained  by  concatenating  two  or  more  first-order  boundary  operators. 

In  TLM,  the  absorbing  boundary  conditions  can  be  applied  directly  to  the  TLM  voltage  impulses 
reflected  by  the  ABC  or  to  the  total  node  voltages,  while  in  FDTD  they  are  normally  applied  to  the 
electric  fields.  We  have  studied  the  absorption  properties  of  Higdon’s  absorbing  boundaries  in 
TLM  and  FDTD  by  applying  them  to  a  rectangular  waveguide,  while  the  stability  aspect  has  been 
studied  by  computing  the  RCS  of  a  high  permittivity  dielectric  cube.  As  this  cube  rings  for  a  very 
long  time,  it  is  a  very  good  test  case  to  check  the  stability  aspect  of  the  absorbing  boundaries. 
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Absorbing  Boundary  Conditions 

To  obtain  good  absorption  over  a  wide  range  of  incident  angles,  we  have  concatenated  two  first- 
order  Higdon’s  boundary  operators  to  obtain  a  second-order  absorbing  boundary  condition  [8].  In 
FDTD,  a  field  value  at  the  absorbing  boundary  can  be  computed  from  the  knowledge  of  field  val¬ 
ues  at  the  cells  in  front  of  the  boundary  using  the  following  equation: 

=  ia^  + a^)  {m,j,k)- a^a^EP~'\mJ,k)  +  (P  j  +  p^)  *) 

-PjP^f^t/n-  2,Y.fc)-(pjY2+7ip2>'^~^('""2’J’^>"  YiY2£” -  2.7.  *)  ^ 
The  interpolation  coefficients  are: 

{a-g-(\-b))  (fl-l+gj^)  {-o.-bg-) 

=  (a-  \-g.i\-b)  -£,A/)’  ^  (a-  \-g.{\-b)-tAl)’  ^  (a- \  -  g.(]  -  b)  -  t  .M) 

(2) 

where  coefficients  aandb  are  weighted  time  and  space  averages  of  the  space  and  time  differences, 
respectively.  £]  and  £2  are  damping  factors.  The  parameter  g,-  is 


where  A1  and  At  are  the  space  resolution  and  time  step,  respectively,  fij  are  the  incidence  angles 
which  can  be  chosen  to  optimize  the  absorption  bandwidth.  The  values  of  coefficients  a  and  b  can 
be  chosen  to  control  the  stability  of  the  absorbing  boundaries.  According  to  Higdon  [6],  a  must  be 
less  than  or  equal  to  0.5  (for  a=b)  to  get  stable  absorbing  boundary  conditions. 

In  TLM,  the  above  ABC  can  be  applied  in  the  following  two  ways: 

a)  Application  of  ABC  to  the  total  node  voltages,  V^: 

The  interpolation  equation  (1)  is  applied  to  the  total  voltages  at  the  nodes  on  the  absorbing  bound¬ 
ary.  Then  the  impulses  incident  at  the  nodes  from  the  absorbing  boundary  can  be  obtained  by  sub¬ 
tracting  the  other  three  voltage  impulses  from  the  total  node  voltages  (see  Fig.  1): 

.  V.  ~  2V"aJR)  -  tv,  (/./R)  +  Vjd'.yR)  +  VjCiJ;!)]  2  (4) 

me  4  z 

b)  Application  of  ABC  directly  to  the  TLM  voltage  impulses: 

The  interpolation  equation  (1)  is  applied  directly  to  the  TLM  voltage  impulses  to  obtain  the  volt¬ 
age  impulses  incident  at  the  absorbing  boundary  nodes.  For  example,  in  Fig.  2,  the  absorbing 
boundary  conditions  are  applied  directly  to  the  incident  voltage  impulses  incV4.  Since,  these  inci¬ 
dent  voltage  impulses  contain  both  and  Hy,  it  is  equivalent  to  applying  ABC  to  both  electric 
and  magnetic  fields  simultaneously. 

Numerical  Results 

First,  we  have  applied  the  above  ABCs  at  the  two  ends  of  a  WR  28  rectangular  waveguide  section. 
We  have  employed  a  2D  shunt  TLM  mesh  and  a  2D  FDTD  mesh.  The  mesh  size  was  (100x42). 
The  space  resolution  and  time  step  were  0. 169  mm  and  0.399  ps,  respectively.  The  domin^t  TEjq 
mode  in  the  waveguide  was  excited  by  injecting  the  total  voltages  ( E^  values  in  FDTD)  in  a  line 
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whose  magnitudes  distributed  according  to  the  sine  function.  Cosine  modulated  Gaussian  tempo¬ 
ral  variation  was  used  to  make  sure  that  only  the  frequencies  of  interest  were  excited.  The  electric 
field  values  were  sampled  along  the  propagation  direction  and  Fourier  transformed  to  obtain  the 
minimum  and  maximum  field  values  at  each  frequency.  The  magnitude  of  reflections  obtained  as 
(VSWR-1)/(VSWR+1)  are  plotted  in  Figs.  3a  and  3b  for  two  sets  of  incident  angles.  We  can  see 
that  the  plot  of  the  reflections  of  the  ABC  applied  to  the  TLM  total  node  voltages  is  overlapping 
with  that  of  the  reflections  of  the  ABC  applied  to  FDTD.  But,  the  reflections  obtained  for  the  ABC 
applied  to  the  incident  TLM  impulse  voltages  are  lower  by  about  20  dB.  It  should  be  noted  here 
that  the  superior  performance  of  the  absorbing  boundaries  in  the  TLM  environment  has  also  been 
independently  observed  by  Giannopoulos  and  Tealby  [10],  They  have  observed  that  even  the  first- 
order  ABC  applied  to  the  TLM  voltage  impulses  performs  better  than  the  second-order  ABC 
applied  to  the  FDTD/TLM  (total  node  voltage). 

As  mentioned  before,  we  have  considered  a  high  permittivity  dielectric  cube  to  study  the  stability 
aspect.  The  size  of  the  cube  was  7.72  mm  and  its  permittivity  was  79.46.  This  cube  rings  for  a 
very  long  time  [11-12].  We  have  employed  a  3D  symmetrical  condensed  node  TLM  [13]  mesh 
and  a  3D  FDTD  mesh  for  the  analysis  of  the  cube.  It  was  discretized  with  15  cells  on  each  side. 
The  space  resolution  and  time  step  were  0.5147  mm  and  0.8578  ps,  respectively.  A  plane  wave 
with  a  electric  field  component  Ey  and  magnetic  field  component  was  incident  on  one  face  of 
the  cube  normally  (shown  in  Fig.  4).  This  was  achieved  in  the  symmetrical  condensed  node  by 
launching  the  impulses  on  branch  3.  We  have  used  a  cosine  modulated  Gaussian  pulse  as  the  exci¬ 
tation  to  make  sure  that  only  the  frequencies  of  interest  were  excited.  This  also  ensured  that  the 
D.C.  and  very  low  frequencies  were  not  excited,  otherwise,  the  absorbing  boundaries  could  not  be 
placed  close  to  the  cube  surface.  The  excitation  Gaussian  pulse  width  (corresponding  to  -34  dB) 
was  148  At.  The  values  of  the  incidence  angles  used  in  the  design  of  the  absorbing  boundaries 
were  30°  and  80°. 

The  absorbing  boundaries  were  placed  20  cells  away  from  the  cube  surfaces.  This  distance  is 
often  referred  to  as  white  space  [11].  The  mesh  size  was  (55x55x55).  In  case  of  the  TLM  analysis, 
we  observed  growing  oscillations  beyond  2000  time  steps  for  positive  values  of  coefficients  a 
=b=0.25.  However,  for  negative  values  of  the  coefficients  a=b=-0.25  there  is  no  stability  problem 
(see  Figs.  5a  and  5b).  Stability  beyond  60,000  time  steps  has  been  obtained  with  a  white  space  of 
just  20  Al.  The  RCS  computed  with  60,000  TLM  time  steps  is  plotted  in  Fig.  8  for  a  bistatic  angle 
of  8  degrees.  The  computation  compares  well  with  the  measurements. 

Next,  we  have  used  the  FDTD  method  to  analyze  the  same  structure.  The  same  space  resolution, 
time  step,  white  space  (20  Al)  and  absorbing  boundary  parameters  were  used.  We  observed  grow¬ 
ing  time  responses  for  both  positive  and  negative  values  of  the  coefficients  a  and  b  (see  Figs.  6a 
and  6b).  Growing  oscillations  beyond  32,768  time  steps  have  also  been  reported  in  the  FDTD 
analysis  of  the  cube  using  Mur’s  absorbing  boundaries  in  spite  of  a  larger  white  space  of  40  Al 
[11]).  However,  introduction  of  the  damping  factors  controlled  the  growth,  but  the  time  response 
(see  Figs.  7a)  has  a  slight  oscillatory  behaviour  around  D.C.  The  RCS  plot  computed  with  this 
time  response  is  not  smooth  (see  Fig.  9),  and  the  results  are  machine  dependent.  This  shows  how 
the  computer  round  off  errors  can  affect  a  time-domain  analysis  when  the  absorbing  boundaries 
are  placed  close  to  a  scatterer.  We  tried  with  different  values  of  the  damping  factors,  but  the  results 
were  only  slightly  better.  This  may  be  due  to  the  interaction  of  near  fields  with  the  absorbing 
boundaries.  To  ascertain  this,  we  increased  the  white  space  from  20  Al  to  30  Al.  This  increases  the 
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computational  space  from  166,375  cells  to  274,625  cells,  i.e.  by  a  factor  of  1.65  times.  The  FDTD 
time  response  is  plotted  in  Fig.  7b  It  does  not  have  any  oscillatory  behavior  around  D.C.  RCS 
data  computed  with  this  time  response  are  plotted  in  Fig.  8.  Both  FDTD  and  TLM  results  com¬ 
pare  well  with  measurements  at  lower  frequencies.  At  higher  frequencies,  the  FDTD  and  TLM 
results  are  shifted  towards  the  lower  end  with  respect  to  the  measurement.  However,  this  shift  is 
larger  for  the  FDTD  results  than  the  TLM  results. 

Conclusions 

The  performance  of  Higdon’s  ABCs  in  TLM  and  FDTD  under  identical  conditions  has  been  stud¬ 
ied.  ABCs  applied  directly  to  the  TLM  voltage  impulses  absorb  better  than  those  applied  to  the 
TLM  total  node  voltages  or  FDTD  fields.  Our  study  shows  that  in  a  FDTD  analysis  the  damping 
factors  have  a  greater  influence  on  the  stability,  while  in  a  TLM  analysis  the  discretization  coeffi¬ 
cients  such  as  the  weighted  space  average  of  the  time  difference  and  the  weighted  time  average  of 
the  space  difference  have  a  greater  effeet  than  the  damping  factors.  Also,  in  a  TLM  analysis,  the 
absorbing  boundaries  can  be  placed  closer  to  the  scatterer  than  in  a  FDTD  analysis.  These  features 
of  TLM  outweigh  the  computational  efficiency  of  the  FDTD  method  in  the  examples  presented 
above. 
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Fig.  1  a:  2-D  Shunt  TLM  Node  Fig.  lb:  2-D  FDTD  Node 

_  i 


Fig.  4:  The  high  permittivity  cube  with  incident  electric  field  Ey  (cube  side  length  = 
7.72  mm,  =  79.46,  bistatic  angle  =  8  degrees) 
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Fig.  3a:  Return  loss  of  Higdon’s  second-order 
absorbing  boundary  condition  in  a  WR28 
waveguide  (a  =  0.5,  b  =  0.5,  0]  =  30®,  02  = 
45®,  ei=0.0,£2  =  0.0) 
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Fig.  3b;  Return  loss  of  Higdon’s  second- 
order  absorbing  boundary  condition  in  a 
WR28  waveguide  (a  =  0.5,  b  =  0.5,  0^  =  5®, 
02  =  60®,  £1  =  0.0,82  =  0.0) 


Number  of  Time  Steps 


Fig.  5a:  TLM  time  response  of  the  high 
permittivity  cube  with  positive  coefficients 
a  and  b  {a=0.25,  b=0.25,  0j=3O®,  02=80®, 
£l  =  0.0,  £2  =  0.0,  white  space  =  20  Al) 


Fig.  5b:  TLM  time  response  of  the  high  per¬ 
mittivity  cube  with  negative  coefficients  a 
and  b  (a=-0.25,  b=-0.25,  0i=3O®,  02=80®, 
£1  =  0.0,  £2  =  0.0,  white  space  =  20  Al) 
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Fig.  6a:  FDTD  time  response  of  the  high  Fig.  6b:  FDTD  time  response  of  the  high 
permittivity  cube  with  positive  coeffi-  permittivity  cube  with  negative  coefficients 
cients  a  and  b  (a  =  0.5,  b  =  0.5,  Oj  =  a  and  b(a  =  -  0.25,  b  =  -  0.25,  Oj  =  30®,  02 

30®,  02  =  80®,  £]  =  0.0,  £2  =  0.0,  white  =  80®,  £]  =  0.0,  £2  =  0.0,  white  space  =  20 

space  =  20  Al)  Al) 


Number  of  Time  Steps  *  Number  of  Time  Steps 


Fig.  7a:  FDTD  time  response  of  the  high  Fig.  7b;  FDTD  time  response  of  the  high 
permittivity  cube  with  white  space  =  20  Al  permittivity  cube  with  white  space  =  30  Al 
(fl  =  0.5,  b  =  0.5,  01=  30®,  02=  80®,  (a  =  0.5,  b  =  0.5,  0i=  30®,  02=  80®, 

£1  =  0.001,  £2  =  0.001)  £1  =  0.01,  £2  =  0.0) 
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Fig.  8:  RCS  computed  with  TLM,  FDTD  and  comparison  with  measurements 


Fig.  9:  RCS  computed  with  FDTD  for  white  space  -  20  A1 
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Abstract 

The  three-dimensional  alternating  transmission  line  matrix  scheme  (ATLM)  is  ap¬ 
plied  to  the  modelling  of  coplanar  waveguide  discontinuities.  In  the  ATLM  scheme  the 
TLM  cells  are  subdivided  into  two  subsets  of  mutually  neighbouring  cells.  Within  each 
time  step  the  state  of  one  subset  of  cells  is  computed  from  the  states  of  the  neighbouring 
cells  at  the  previous  time  step.  Compared  with  existing  TLM  schemes  the  numerical 
effort  as  well  as  the  storage  requirements  are  reduced  by  up  to  50%  without  loss  of  accu¬ 
racy.  Furthermore,  spurious  solutions  occuring  in  existing  TLM  schemes  can  be  avoided 
by  ATLM. 


1  Introduction 

The  modelling  of  coplanar  \vaveguide  structures  requires  three-dimensional  electromagnetic 
full- wave  simulation  [1].  For  modelling  coplanar  discontinuities  the  TLM  method  exhibits  the 
advantage,  that  a  great  variety  of  structures  may  be  analyzed  without  a  high  effort  of  adapting 
the  method  to  the  specific  problem.  The  three-dimensional  TLM  method  with  symmetrical 
condensed  node  (SCN)  introduced  by  Johns  [2]  has  proven  to  be  a  very  powerful  method  of 
electromagnetic  field  computation  [3].  In  the  alternating  transmission  line  matrix  (ATLM) 
scheme  the  TLM  cells  are  subdivided  into  two  subsets  of  mutually  neighbouring  cells.  Within 
each  time  step  the  state  of  one  subset  of  cells  is  computed  from  the  states  of  the  neighbouring 
ceils  at  the  previous  time  step.  Compared  with  existing  TLM  schemes  the  numerical  effort  as 
well  as  the  storage  requirements  are  reduced  by  up  to  50%  without  loss  of  accuracy.  Further¬ 
more,  spurious  solutions  occuring  in  existing  TLM  schemes  can  be  avoided  by  ATLM  [4,5]. 

In  homogeneous  regions  modelled  by  TLM  without  stubs  there  is  no  mutual  dependence 
of  the  states  of  neighbouring  cells.  This  can  be  seen  clearly  if  we  define  a  so-called  parity 
p  assigned  to  TLM  wave  pulses  discrete  space  coordinates  I,  m,  n  and  the  discrete  time 
coordinate  k  by 

p  =  sign  (k  +  l  +  m  +  n)  .  (1) 

If  the  sum  A:  +  ^  +  m  +  n  is  even  then  p  =  +1  and  if  fc  +  1  +  m  +  n  is  odd  then  p  =  —1. 
Now  it  is  easy  to  check  that  in  any  TLM  mesh  without  stubs  and  boundaries  the  TLM  pulses 
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depend  only  on  previous  pulses  with  the  same  parity.  If  in  the  complete  TLM  mesh  including 
stubs  only  transmission  line  elements  of  length  Al  are  used,  the  parity  of  all  transmitted  and 
reflected  pulses  is  conserved.  Regions  with  different  permittivites  can  be  modeled  using  a  TLM 
mesh  with  constant  Al  if  stubs  are  introduced.  In  order  to  maintain  synchronism,  in  ATLM 
stubs  of  length  Al  instead  of  stubs  with  length  Al/2  have  to  be  used. 


2  Modelling  of  Dielectric  Media  in  SCN-ATLM 


In  ATLM  with  symmetrical  condensed  node  (SCN)  dielectric  and  resistive  media  are  modelled 
via  stubs.  The  permittivity  and  the  permeability  of  the  material  determine  the  characteristic 
admittance  and  impedance  of  the  stubs.  If  regions  with  different  permittivities  are  modelled 
the  region  with  the  lowest  permittivity  may  be  modelled  without  stub.  Modelling  of  losses  is 
also  possible.  Therefore  resistive  stubs  are  introduced,  which  do  not  reflect  incident  pulses. 

In  contrast  to  ordinary  SCN-TLM  with  stubs  the  stub  length  has  to  be  increased  from 
Al/2  to  Al.  If  a  stub  length  of  Al  is  chosen,  the  pulses  reflected  from  the  end  of  the  stub 
are  synchronous  with  the  pulses  propagating  in  the  mesh.  This  is  achieved  by  storing  the 
pulses  for  two  time  steps  in  the  stubs.  Therefore,  we  have  to  change  the  stub  admittances  and 
impedances  for  compensating  the  double  length,  i.  e.  one  half  of  ys  and  Zg  of  th®  Al/2  stubs 
are  assigned  to  the  corresponding  parameters  of  the  Al  stubs. 

The  three-dimensional  condensed  node  scheme  will  be  extended  in  the  following  way.  The 
scattering  matrix  S  in  the  symmetrical  notation  corresponding  to  fig.  1  is  given  by 
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The  parameters  a,  P,  7,  5  and  Xi  are  given  by 


Us  T  9s 


+  ■ 


Zs+Ts 


8  +  2r/s  +  2gs  8  +  2zs  +  2t, 


P=- 


ys  +  ps 


•^s  T  Tc 


8  +  2ys  +  2^5  8  +  2^s  +  2rs 


2 

^  ^  +  Us  +  9s 

Xl  =  \/^7  ,  X2  =  y/^s5  ,  X3 
with  the  stub  admittance  and  impedance 


4  +  Zs  +  T-s  ’ 

ys  -  ffs  -  4  _  4  -  +  Ts  ■ 

ys  +  Ps  +  4  ’  4  +  ’ 


ys  =  2{er  -  1)  ,  2:^  =  2{fir  -  1)  ■ 


The  electric  losses  and  the  magnetic  losses  are  introduced  by 


9s  —  .  Tg 


(7) 

(8) 
(9) 


(10) 


(11) 


We  use  the  mapping  introduced  by  Johns  [2,6],  which  relates  all  six  field  components  at  the 
center  of  the  node  to  the  wave  amplitudes  of  its  twelve  ports. 


3  Modelling  of  the  Walls  in  SCN-ATLM 

In  order  to  conserve  the  parity  of  the  TLM  pulses  in  ATLM  transmission  lines  of  length  Al/2 
are  not  allowed.  Walls  may  only  be  located  therefore  in  planes  spanned  by  the  centers  of 
the  TLM  cells.  This  requires  the  introduction  of  boundary  and  wall  nodes  for  plane  surfaces, 
corners,  and  edges.  The  scattering  matrices  describing  the  boundary-  and  wall  nodes  of  SCN- 
ATLM  may  be  derived  from  the  scattering  matrix  the  SCN  with  stubs  of  length  Al.  Scattering 
matrices  for  boundary  nodes  are  computed  in  the  following  way: 

1.  Elimination  of  ports: 

•  Ports  in  the  plane  of  an  electric  wall  with  polarization  parallel  to  the  wall  are 
short-circuited. 

•  Ports  in  the  plane  of  a  magnetic  wall  with  polarization  normal  to  the  wall  are 
open-circuited. 

•  Ports  behind  an  electric  wall  are  open-circuited, 

•  Ports  behind  a  magnetic  wall  are  short-circuited. 

2.  Change  of  the  characteristic  impedances: 

•  The  characteristic  impedance  of  a  line  in  the  plane  of  an  electric  wall  with  polar¬ 
ization  normal  to  the  wall  is  divided  by  2. 

•  The  characteristic  impedance  of  a  line  in  the  plane  of  a  magnetic  wall  with  polar¬ 
ization  parallel  to  the  wall  is  multiplied  by  2. 
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As  an  example  we  present  the  scattering  matrix  for  the  electric  wall  bounding  a  dielectric 
medium  with  relative  permittivity  We  obtain  the  scattering  matrix  for  the  electric  wall 
using  the  SCN  with  stubs 


r  0  0  0  0  on 

0  0  0  0  0 
0  6  6  6  ^4 

^  0  6  ei  6  6  6 

0  ^2  ^2  Cl  Cl  C4 

0  C2  C2  Cl  Cl  C4 

0  0  C4  C4  C4  C4  Cs . 


(12) 


with  the  parameters 


Ci  = 


1  ~  gf 
2£r-l'' 


C2  = 


4£r  -  2  ' 


C3- 


2g,-3  V2£,-2 

2£,-1’  2£r-l 


(13) 


Since  boundaries  and  walls  are  intersecting  the  ATLM  cells  the  scattering  matrices  of  the 
boundary  and  wall  ceils  and  have  to  be  calculated  numerically  prior  to  the  start  of  the  field 
simulation.  However  this  requires  only  a  minor  part  of  the  computational  effort. 


4  Numerical  Simulations  with  ATLM 

We  have  applied  the  ATLM  scheme  to  the  modelling  of  coplanar  lines  and  coplanar  line 
discontinuities  on  GaAs  substrate.  The  reduction  of  the  computational  effort  and  the  memory 
requirements  does  not  affect  the  accuracy  of  the  computation.  In  the  case  of  the  stubless  SCN 
a  reduction  of  factor  2  is  achieved.  When  using  SCN  with  stubs,  memory  requirement  reduces 
by  a  factor  of  2/3. 


Tab.  1:  Memory  requirements  of  ATLM  and  traditional  TLM 


SCN  with  stubs 

ATLM 

trad.  TLM 

mesh  size 

71  X  71  X  174 

71  X  71  X  174 

number  of  cells 

877134 

877134 

actual  numer  of  cells  in  memory 

438567 

877134 

data  per  cell 

TLM  amplitudes:  12 
stub  amplitudes:  6 
media  pointer:  1 

TLM  amplitudes:  12 
stub  amplitudes:  6 
media  pointer:  1 

effective  data  per  cell 

TLM  amplitudes:  5  x  12 
stub  amplitudes:  6 
media  pointer:  1 

TLM  amplitudes:  12 
stub  amplitudes:  6 
media  pointer:  1 

total  memory 

46.8  Mb 

66.9  Mb 

The  ATLM  scheme  is  used  to  determine  the  reflection  coefficient  of  the  CPW  discontinuity 
located  in  a  hollow  waveguide.  Fig.  2  shows  the  dimensions  of  the  structure’s  geometry.  For 
comparison,  reference  values  of  the  same  structure  are  calculated  by  mode  matching  [1]. 


313 


For  performing  the  ATLM-simulation  equidistant  meshing  is  applied.  In  order  to  reduce 
computational  effort  a  symmetry  plane  is  introduced,  which  cuts  the  CPW  in  longitudinal 
direction.  With  Al  =  0.05mm,  the  mesh  is  discretized  by  71  x  71  x  174  TLM  nodes  in  cubic 
cells.  An  electric  field  with  gaussian  time  dependance  is  excited  at  the  front  end  of  the  line.  As  a 
result  of  the  simulation,  we  obtain  the  time  response  Ecpyf{nAt]  at  the  observation  point,  which 
includes  both,  injected  and  reflected  signals.  Of  particular  interest  is  the  reflection  coefficient 
Su{nAf).  Therefore,  one  has  to  separate  incoming  and  outgoing  signals.  This  requires  the 
simulation  of  a  reference  structure  which  does  not  contain  the  discontinuity,  but  only  the 
CPW-line.  With  this  reference  signal  Eref(nAt),  one  obtains  for  the  reflection  coeflScient 


SninAf)^ 


^cpw  {nAf)  -ErAn^f) 
ErAn^f) 


(14) 


where  .E(nA/)  is  the  discrete  Fourier  transformation  of  E{n'At). 

In  our  simulation,  absorbing  boundary  conditions  are  necessary  at  both  ends  of  the  wave¬ 
guide.  It  is  important  to  suppress  the  artificial  reflection  signals  caused  by  the  mesh  boundaries. 
For  that  purpose  we  insert  layers  of  lossy  media,  as  shown  in  fig.  3.  The  absorbing  layers  are 
terminated  by  electric  walls.  To  prevent  the  problem  of  reflections  at  transitions  between  layers 
with  different  characteristic  impedances,  we  have  to  choose  the  properties  of  the  media  of  the 
layers  in  a  suitable  way.  We  not  only  introduce  an  electric  conductivity  a,  but  also  a  magnetic 
conductivity  am-  If  the  impedance  matching  condition 

-  =  —  ,  (1^) 
P-  <7in 

is  fulfilled,  the  characteristic  impedances  of  the  absorbing  layers  are  equal.  In  our  example, 
we  use  ten  layers  of  matched  lossy  media  with  increasing  conductivity  towards  the  outside. 
For  quasi  TEM-waves,  the  absorbing  capabilities  are  very  good.  The  magnitude  |5ii{  of  the 
reflection  coefficient  of  the  double  step  discontinuity  after  fig.  2  is  depicted  in  fig.  4.  The 
result  of  the  ATLM  simulation  is  compared  with  the  result  of  the  mode  matching  simulation 
according  to  [1]. 
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Fig.  1:  Numbering  schemes  for  the  TLM  ports. 
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Fig.  2:  CPW  with  discontinuity 
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’field  excitation 

Fig.  3;  Location  of  the  mached  absorbing  layers. 


Fig.  4:  Reflection  coefficient  calculated  with  ATLM  and  mode  matching. 


6 


Quasi-Static  Correction  of  a  Knife  Edge  Corner 
in  2D  TLM  Algorithm 

Lucia  Cascio®,  Giampaolo  Tardioli®,  Tullio  Rozzi*,  and  Wolfgang  J.R.  Hoefer° 

®NSERC/MPR  Teltech  Research  Chair  in  RF  Engineering. 

Department  of  Electrical  and  Computer  Engineering,  University  of  Victoria, 

Victoria,  B.C.  V8W3P6,  CANADA 

*Dipartimento  di  Elettronica  ed  Automatica,  Universita’  di  Ancona 
Ancona,  1-60131,  ITALY 

Abstract 

Numerical  techniques  such  as  TLM  and  FDTD  are  used  presently  to  solve  a  large  class  of  electromag¬ 
netic  problems.  However,  a  common  drawback  of  these  techniques  resides  in  the  difficulty  toaccurately 
describe  structures  in  the  presence  of  fine  features  such  as  sharp  edges,  where  a  singularity  of  the  elec¬ 
tromagnetic  fields  is  encountered. 

In  this  paper  a  local  modification  of  the  2D-TLM  algorithm  for  the  nodes  surrounding  a  knife  edge  is 
proposed.  The  Green’s  function  for  a  semi-infinite  knife  edge  is  evaluated  under  a  quasi-static  approxi¬ 
mation  and  used  to  derive  an  equivalent  circuit  of  the  corner,  characterized  as  a  three-port  device. 

The  method  is  tested  by  evaluating  the  resonant  frequencies  of  a  2D  septum-coupled  cavity.  An  accu¬ 
rate  value  for  the  resonant  frequencies  of  the  cavity  is  extracted  by  Richardson  extrapolation  of  TLM 
results  for  increasingly  fine  discretization. 

The  resonant  frequencies  obtained  with  the  comer  correction  algorithm  for  different  discretizations  are 
finally  compared  with  the  extrapolated  results. 

Introduction 

The  transmission  line  matrix  (TLM)  method  [1]  is  widely  regarded  as  an  efficient  and  flexible  tech¬ 
nique  for  the  analysis  of  a  large  class  of  electromagnetic  problems.  One  of  the  main  limitations  of  this 
and  other  numerical  techniques  is  that  the  spatial  discretization  fails  to  accurately  describe  the  singular¬ 
ities  of  the  electromagnetic  field,  which  occur,  for  example,  close  to  sharp  edges. 

Unless  a  very  fine  discretization  mesh  is  used,  the  singular  behavior  around  the  comer  is  poorly  repre¬ 
sented,  and  the  frequency  domain  characteristics  of  the  structure  will  typically  be  shifted.  This  error  is 
very  often  unacceptable  when  we  are  dealing  with  narrow-band  structures  such  as  filters. 

The  accuracy  of  the  discretized  model  can  be  improved  by  introducing  a  better  description  of  the  field 
singularity,  through  local  modification  of  the  algorithm. 

An  approach  based  on  the  local  modification  of  the  standard  TLM  method  to  account  for  the  energy 
stored  around  the  edge  has  been  proposed  in  [2].  The  nodes  surrounding  the  comer  are  loaded  with 
stubs  with  optimized  characteristics. 

In  this  paper  a  new  approach  based  on  the  quasi-static  approximation  of  the  Green’s  function  for  an  infi- 
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nite  conductive  wedge  is  proposed.  The  field  distribution  around  a  comer  is  represented  in  terms  of  an 
equivalent  three-port  circuit;  its  implementation  in  the  TLM  algorithm  is  straightforward  and  particu¬ 
larly  efficient. 


Theory 

Consider  a  cuirent  filament  adjacent  to  a  conducting  wedge,  as  shown  in  Fig.  1,  where  (p’,<t)’)  indicate 
the  source  position  and  (p,(|))  the  field  point.  The  excitation  is  an  impulsive  current  of  strength  I. 


Fig.  1  Perfectly  conducting  wedge 


In  this  two-dimensional  problem  the  electric  field  component  can  be  expressed  as  a  series  of  trigono¬ 
metric  and  Bessel  functions  given  in  [3]  as: 


£  (p,(l))  =G(p,4);p',  <{)■)/ 


G(p,(l);p',(j)’)  =- 


COjITC 


2  (tc  -  a) 

(^P')  (^p)  sin  (V  (<{)  -  a) )  sin  (v  ((j)'  -  a) )  p  <  p’ 
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^  ^  (/cp)7^(/:p')  sin  (v  ((})  -  a) )  sin  (v  (4)’ -  a) )  p>p' 
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In  the  special  case  when  the  angle  a  is  equal  to  zero  we  have  a  so-called  knife  edge.  In  this  case  the 
expression  for  the  Green’s  function  G(p,(|);p ’,([>’)  is  given  by: 


G(p,(l);p’,(t)')  ^ 
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/(2) 
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The  expression  (2)  presents  a  complex  frequency  dependance,  the  variable  k  being  a  function  of  0). 
Using  the  approximation  for  the  Bessel  and  Hankel  functions  for  small  values  of  the  argument  [4],  (2) 
can  be  considerably  simplified,  leading  to  a  quasi-static  solution.  Since 

n 

<y  (3) 

the  quasi-static  approximation  for  is  [5]: 

=G(p,(ti;p',f)/ 

n 

^  ^(^)^sin(^(t))sin(^(l)’) 

W  =  1 

n 

1  .p'  2  .  ,n..  .  ,n 

sin  (-4))  sin  (-(!)) 

n=]  ^ 

Application  to  the  TLM  mesh 

The  quasi-static  expression  for  the  electric  field  described  in  (4)  represents  the  basis  for  the  determina¬ 
tion  of  an  equivalent  circuit  describing  the  field  around  the  edge.  In  the  TLM  method  the  boundary  con¬ 
ditions  can  be  placed  either  between  two  nodes  or  on  the  nodes.  In  the  second  case  the  basic  scattering 
process  must  be  modified  for  the  boundary  node. 

In  order  to  keep  the  number  of  ports  of  the  equivalent  circuit  for  the  edge  as  small  as  possible,  the  latter 
approach  has  been  chosen  (Fig.  2).  In  this  way  only  three  TLM  nodes  are  directly  connected  to  the 
edge,  and  a  three-port  equivalent  circuit  is  sufficient  to  characterize  its  behavior. 


Fig.  2  TLM  mesh  and  knife  edge  location 


The  expression  (4)  leads  to  a  Z-matrix  representation  of  the  equivalent  circuit.  Since  the  voltages  and 
currents  at  the  three  ports  are  related  to  the  electric  field  and  to  the  current  density  (5),  the  imped- 
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ance  elements  depend  on  the  Green’s  function  as  given  in  (6). 


27cp. 
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Z  - 

z.  -  7- 


U 


h.j  =  o 


=  -nAlG  (p.,  (j).;p'^,  (?'.}  =  G  (p.,  <t)7;p'^.,  (!)’p;  i,J  =  1,2,3 
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Due  to  the  reciprocity  of  the  Green’s  function  and  to  the  geometrical  symmetry  of  the  problem  there  are 
only  four  distinct  elements  Zy.  The  general  form  of  the  impedance  matrix  is: 


[Z] 


-n  ^12  ^13 
^12  ^22  ^12 
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=  jay[Z] 


where  the  elements  Zij  are  frequency  independent. 

A  more  general  definition  for  the  impedances  is  given  by: 


(7) 


=  w  J  1  (8) 

‘  JWiWj 

where  G(s;so)  represents  the  Green’s  function  determined  in  (6),  and  Wj,Wj  are  the  domains  of  integra¬ 
tion  for  the  source  variables  and  for  the  field  variables.  Three  different  domains  of  integration  have 
been  taken  into  account  (Fig.  3),  leading  to  different  formulations  for  the  impedance  elements. 


a)  Pointform  Integration  b)  Linear  Integration  c)  Superficial  Integration 


In  the  case  of  pointform  integration,  expression  (8)  presents  a  singular  behavior  for  the  main  diagonal 
impedances  Z^,  corresponding  to  coincident  source  and  field  points.  To  determine  a  finite  value  for 
these  impedances,  it  is  therefore  necessary  to  consider  the  source  point  and  the  field  point  in  slightly 
different  positions;  in  particular,  we  can  assume  that  the  radii  p  and  p’  are  not  equal.  Introducing  this 
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approximation  into  expression  (8)  we  obtain: 


1,  [1+^ 

^■1  - 
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where  the  term  x  represents  the  ratio  between  p’  and  p.  and  is  assumed  to  be  less  than  unity. 

On  the  other  hand,  the  pulses  traveling  along  the  TLM  lines  represent  the  field  distribution  over  the 
entire  cell;  we  can  therefore  assume  either  a  linear  or  superficial  distribution  of  field  and  of  the  sources 
(Fig.  3)  around  the  edge.  In  particular,  we  expect  that  the  superficial  integration  will  yield  the  best 
results,  since  it  better  accounts  for  the  energy  stored  around  the  comer. 

Expression  (8)  has  been  evaluated  for  both  the  linear  and  superficial  integrations  and  has  given  rapidly 
convergent  results  for  all  the  elements  of  the  matrix;  the  expressions  for  the  two  cases  are  shown  in  (10) 
and  (11),  respectively. 


Once  the  Z-matrix  has  been  obtained,  it  is  also  possible  to  extract  a  lumped  element  representation  of 
the  equivalent  circuit  (Fig.  4). 


Ports 


Fig.  4  Equivalent  circuit  for  the  knife  edge 


Discretization  process 

In  order  to  realize  the  equivalent  circuit  in  the  TLM  mesh,  we  need  to  determine  the  relation  between 
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the  incident  and  reflected  voltages  at  the  ports  as  a  function  of  the  Z-matrix  elements  (7).  So  far,  we 
have  obtained  a  complete  description  of  the  three  port  circuit  by  means  of  its  impedance  matrix  Z; ' 


>1 

^  =  ;co[Z]7; 

V  = 

^2 

;  /  = 

^3 

h 

The  vectors  V  and  I  can  be  expressed  in  terms  of  the  incident  and  reflected  TLM  voltages  at  the  edge,  V 
'  and  V^,  according  to  the  relation:  ^ 


(13) 

where  is  the  TLM  link  line  admittance. 

The  frequency  dependance  joD  is  discretized  using  a  bilinear  transformation  [6];  this  scheme  guarantees 
the  stability  of  the  discretized  model.  The  frequency  distortion  introduced  is  irrelevant  for  frequencies 
propagating  with  low  dispersion  error  in  the  TLM  mesh  (X  >  10  •  Al).  Hence: 
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Substituting  relations  (13)  and  (14)  in  equation  (12),  we  obtain  a  recursive  formulation  (15)  characteriz¬ 
ing  the  comer  condition  in  the  TLM  process. 


In  this  expression  and  Vf}  are  the  vectors  of  the  reflected  and  incident  voltages  at  the  discrete  time 
step  k. 

Expression  (15)  states  that  the  reflected  voltages  at  the  edge  depend  simply  on  the  incoming  voltages 
and  on  the  incident  and  reflected  voltages  at  the  previous  time  step.  The  implementation  of  the  equiva¬ 
lent  circuit  in  the  TLM  algorithm  is  extremely  effective  and  requires  that  only  a  few  extra  values  be 
stored. 


Results 

The  method  has  been  tested  by  evaluating  the  first  resonant  frequency  of  a  2D  septum-coupled  cavity 
(Fig.  5).  An  accurate  value  (reference  line)  for  the  resonant  frequency  of  the  cavity  has  been  extracted 
by  Richardson  extrapolation  of  TLM  data  for  increasingly  fine  discretization  (Fig.  6). 

We  have  obtained  different  results  for  the  different  Z-matrices  given  by  equations  (9),  (10)  and  (1 1).  As 
expected,  the  best  results  have  been  obtained  with  the  superficial  integration  (11);  they  lead  to  a  much 
faster  convergence  to  the  accurate  values  than  those  obtained  with  the  basic  2D-TLM  algorithm. 
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Fig.  5  Septum-coupled  cavity 


Fig.  6  Comparison  between  the  first  resonant  frequencies  obtained  with  the  standard  TLM 
and  different  corner  corrections 

Even  with  a  very  coarse  mesh  (Al=l  mm)  the  superficial  comer  correction  yields  an  improvement  in  the 
accuracy,  reducing  the  percentage  error  from  more  than  2%  to  0.6%. 

If  an  accuracy  of  0.1%  is  required,  the  comer  correction  allows  to  use  a  cell  size  at  least  four  times 
larger  than  that  necessary  with  the  standard  TLM.  Therefore,  when  high  accuracy  is  desired,  the  corner 
correction  reduces  considerably  the  computational  effort. 


Conclusion 

In  this  paper  a  local  modification  of  the  2D-TLM  algorithm  for  the  nodes  surrounding  a  knife  edge  has 
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been  proposed.  The  Green’s  function  for  a  semi-infinite  knife  edge  has  been  evaluated  under  a  quasi¬ 
static  approximation  and  used  to  derive  an  equivalent  circuit  of  the  comer,  in  the  form  of  a  three-port 
device.Three  possible  definitions  for  the  Z-matrix  of  the  circuit  have  been  considered,  corresponding  to 
different  domains  of  integration  for  the  source  and  field  coordinates. 

The  method  has  been  tested  by  determining  the  first  resonant  frequency  of  a  septum-coupled  cavity 
and,  as  expected,  has  yielded  rapidly  convergent  results  in  the  case  of  superficial  integration. 

The  improved  description  of  the  singular  behavior  of  the  field  around  the  edge  allows  considerable  sav¬ 
ings  in  computer  time  and  memory  requirements,  since  the  desired  accuracy  can  be  achieved  by  using  a 
coarser  discretization. 
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ABSTRACT 

The  Australian  Army  Technical  and  Engineering  Agency  (ATEA)  developed  its  AutoNEC  capability  as  a 
convenient  means  of  creating  and  editing  models  to  be  run  in  the  Numerical  Electromagnetics  Code  (NEC), 
and  for  examining  the  results.  It  comprises  a  number  of  procedures  which  interface  the  drafting  package 
AutoCAD  to  NEC.  During  the  last  year,  AutoNEC  has  been  extended  to  interface  to  3D  Studio,  a  rendering 
and  animation  program.  This  has  resulted  in  a  considerably  enhanced  electromagnetic  visualisation  capability. 
This  paper  describes  some  of  the  aspects  of  AutoNEC,  its  continuing  development  and  the  use  of  3D  Studio. 

BACKGROUND 

Computing  power  and  speed  have  increased  considerably  over  the  last  decade.  So  to  have  the  capabilities  of 
computer  programs  to  model  the  complexities  of  electromagnetic  phenomena  associated  with  radiating  and 
radiated  structures.  The  electromagnetics  worker  can  now  choose  from  a  wider  variety  of  more  readily 
available  and  sophisticated  tools,  and  work  in  greater  detail  than  ever  before.  An  inevitable  consequence  of 
such  advancements  is  the  difficulty  in  interpreting,  in  a  global  sense,  the  vast  amounts  of  data  which  can  be 
generated.  The  extension  of  ATEA’s  AutoNEC  capability  to  interface  to  the  rendering  and  animation  package 
3D  Studio,  was  undertaken  to  address  this  problem,  and  has  demonstrated  considerable  potential  in  the  art  of 
electromagnetic  visualisation. 

Antenna  radiation  patterns  have  traditionally  been  displayed  as  graphs  showing  cross  sections  of  what  is 
essentially  a  three  dimensional  structure  -  theta,  phi  and  antenna  gain. 


Isometric 


Figure  1:  Plan,  elevation  and  isometric  views  of  antenna  gain  (far  field)  pattern. 
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The  preceding  images  show  antenna  radiation  patterns  for  an  Army  field  antenna  for  HF  communications.  The 
patterns  are  for  one  frequency  and  were  drawn  in  AutoCAD,  using  AutoNEC.  The  plan  view  gives  sorne  idea  of 
the  directional  performance  of  the  antenna,  in  this  case  at  a  takeoff  angle  of  five  degrees.  The  elevation  shows 
the  variation  of  gain  with  takeoff  angle,  an  important  factor  for  skywave  communications.  The  isometric  view 
gives  a  more  global  impression  of  the  pattern.  Understanding  the  overall  pattern,  particularly  where  its  shape  is 
complex,  usually  requires  generation  and  comprehension  of  a  multitude  of  such  graphs.  Consideration  of 
polarisation  and  phase  add  further  complexity  as  additional  data  dimensions  become  involved. 

Near  field  data  is  of  importance  to  radiation  hazard  (RADHAZ)  studies.  In  its  simplest  case  this  data  is  a  four 
dimensional  structure  (x.y.z  and  field  magnitude)  and  the  difficulties  of  meaningful  display  are  correspondingly 


greater. 


Figure  2:  Cross-section  of  near 
field  data.  This  horizontal  cross 
section  shows  RADHAZ  profiles  for 
personnel  and  explosives.  The  subject 
is  an  Army  vehicle  (Land  Rover)  fitted 
with  a  high  frequency  (HF)  transmitter, 
with  its  whip  antenna  in  the  upper  left 
location  (dark  area).  Even  with  this 
crude  resolution,  each  cross  section 
contains  some  4500  separate  data 
points.  Examination  of  up  to  thirty 
cross  sections  taken  at  different 
heights  is  necessary  to  adequately 
investigate  hazardous  areas  in  three 
dimensions.  With  28  steps  across  the 
HF  band  (2  to  30  MHZ),  the  total  data 
amounts  to  some  4,000,000  points. 
The  NEC  output  files  for  this 
investigation  before  processing 
amounted  to  over  150  Mbyte. 


There  will  always  be  a  need  for  tabular  representation  of  field  data  to  obtain  values  at  specific  points.  The 
masochist  may  still  be  prepared  to  wade  through  many  pages  of  printout  in  an  attempt  to  gam  an  overall 
appreciation  of  the  results.  A  number  of  approaches  to  displaying  this  data  in  a  meaningful  graphical  form 
have  already  been  developed,  and  met  with  varying  degrees  of  success.  However,  recent  advances  in 
rendering  and  animation  software  have  opened  the  door  to  another  dimension  in  electromagnetic  visualisation. 
This  paper  deals  predominantly  with  AutoNEC  and  the  application  of  rendering  and  animation  capabilities  of  3D 
Studio  to  the  visualisation  of  far  field  radiation  patterns. 


For  several  years  ATEA  has  been  developing  and  enhancing  its  AutoNEC  capability.  Running  on  a  personal 
computer,  it  uses  a  customised  AutoCAD  Version  12  to  generate  three  dimensional  models  for  analysis, 
perform  model  integrity  checks,  and  to  examine  the  results.  Although  AutoNEC  uses  NEC-2D  as  the 
electromagnetics  engine,  there  is  no  reason  why  the  techniques  developed  could  not  be  adapted  to  later 
versions  of  NEC  as  well  as  other  modelling  tools.  The  customising  software  developed  runs  both  within  and 
outside  AutoCAD  and  is  constantly  evolving  as  new  and  different  types  of  analysis  problems  are  encountered. 
A  paper  on  AutoNEC,  as  it  stood  in  1994,  was  presented  at  the  ACES  Tenth  Annual  Review  of  Progress  of  that 
year. 

Commercial  rendering  and  animation  packages  such  as  3D  Studio  have  little  inherent  capability  that  might  be 
exploited  to  directly  investigate  electromagnetic  phenomena,  and  although  AutoCAD  has  some  rendering 
capability,  the  exporting  of  data  to  3D  Studio  provides  a  vastly  improved  capability.  The  rendered  and 
animated’  images  so  produced  provide  dramatically  realistic  images  and  sequences  of  electromagnetic 
phenomena. 
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AutoCAD  ENHANCEMENTS 


AutoCAD  is  a  drafting  package  with  an  uncommitted  architecture.  User-written  menus,  dialogue  boxes,  drafting 
and  other  processes  and  even  special  drawing  entities  can  be  developed  to  provide  user-specific  capabilities. 
User  processes  embedded  in  AutoCAD  can  be  written  in  the  languages  of  AutoLISP,  as  well  as  C,  and  can 
easily  be  incorporated  as  extensions  of  AutoCAD's  already  extensive  capability. 

AutoCAD’s  Extended  Entity  Data  is  a  means  of  attaching  non-drafting  parameters  to  drawing  entities.  Wire 
diameters,  excitations  and  loading  are  attached  as  extended  data  to  the  lines  representing  the  wires  of  the 
model,  thus  creating  drawing  entities  that  closely  mimic  the  corresponding  entries  In  the  NEC  input  file.  Taking 
this  capability  further,  special  drawing  entities  have  been  developed  to  define  networks,  transmission  lines  and 
run  time  parameters.  Observation  points  for  near  and  far  field  runs  are  input  as  special  drawing  entities  which 
geometrically  represent  the  observation  point  sets,  ground  parameters  are  stored  as  extended  entity  data  on  a 
3D  face  physically  representing  the  ground  location,  and  frequency  data  is  similarly  input  as  a  ‘drawing’  entity. 
Thus  the  drawing  can  contain  not  only  the  geometry  of  the  model,  but  also  its  electrical  and  run-time 
parameters. 

In  order  to  accurately  represent  structures  close  to  a  finite  ground,  NEC  uses  the  Sommerfeld/Norton  method 
for  ground  modelling.  Although  NEC  can  model  using  surfaces  patches,  the  Sommerfeld/Norton  method  is 
only  available  for  wire  models.  Since  the  Australian  Army  is  predominantly  concerned  with  equipment  near  the 
ground,  AutoNEC  development  has  concerned  itself  primarily  with  wire  models 

Once  the  model  is  drawn  in  AutoCAD,  and  the  electrical  and  run-time  parameters  are  added,  these  are  output 
as  a  text  file  essentially  in  the  form  of  a  NEC  input  file.  Because  of  the  limitations  in  AutoLISP’s  formatting 
capabilities,  file  formatting  and  NEC  pre-processing  is  done  outside  AutoCAD  using  procedures  written  in 
Pascal.  In  addition  to  the  problem  definition,  the  output  file  also  contains  the  first  entries  in  an  audit  trail  which 
is  extended  as  processing  proceeds. 

NEC  PRE-PROCESSING 

The  NEC  pre-processor  creates  the  NEC  input  files  from  the  AutoCAD  output  files.  As  well  as  format 
adjustment,  model  integrity  checks  are  performed.  Duplicated  or  zero  length  wires  resulting  from  inaccurate 
drafting  are  removed,  and  wire  ends  within  a  specified  window  size  can  be  merged.  Some  dubious  modelling 
strategies  can  be  flagged  and  the  resulting  files  read  back  into  AutoCAD  for  further  examination.  As  the  files 
are  in  a  text  format,  they  can  also  be  edited  with  a  text  editor  -  at  the  user's  risk. 

Figure  3:  Wire  mode  of  Helicopter 

This  1500-wire  model  was  received  as 
25  pages  of  10  columns  of  data.  It 
contained  some  800  geometric  errors 
and  not  surprisingly,  it  would  not  run. 
Software  was  written  to  check  model 
integrity,  to  locate  the  errors  and  correct 
them.  These  procedures  are  being 
extended  so  as  to  automatically 
generate  a  critique  of  the  model,  flag 
dubious  model  structures,  suggest 
possible  improvements,  as  well  as  to 
note  violations  of  the  NEC  modelling 
guidelines. 
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The  NEC  capability  to  perform  multiple  passes  with  frequency  stepping  is  not  used  in  AutoNEC.  If  multiple 
frequencies  are  required,  a  separate  input  file  is  created  for  each  frequency,  together  with  batch  file  to  run  the 
whole  job.  Because  the  files  for  each  frequency  are  separate,  wire  segmentation  can  be  optimised  as  a 
function  of  frequency,  and  this  is  performed  automatically  when  the  files  are  created.  This  results  in  a  faster 
turn-around  of  jobs  covering  a  wide  frequency  range,  and  saves  the  user  from  determining  the  segmentation 
himself.  Similarly,  the  diameters  of  wires  representing  the  edges  of  surfaces  are  automatically  computed  at 
this  point  in  accordance  with  NEC  guidelines.  Another  advantage  of  the  multi-file  approach  is  that  it  minimises 
the  havoc  resulting  from  NEC  hanging  or  a  power  failure,  particularly  where  jobs  run  for  extended  periods. 

NEC  PROCESSING 

Before  the  batch  file  passes  an  input  file  to  NEC,  it  calls  a  timer  program  which  provides  the  user  with  an 
estimation  of  the  completion  time  of  the  current  NEC  run.  This  is  based  on  the  time  history  of  previous  NEC 
runs  within  the  current  batch  file. 

The  program  SOMNEC  (provided  as  part  of  the  NEC  codes)  is  used  to  create  the  interpolation  tables  used  by 
NEC-2  for  Sommerf eld/Norton  ground  modelling  (not  required  in  NEC-4),  if  the  Sommerfeld/Norton  method  is 
requested,  before  running  NEC  the  batch  file  calls  a  program  which  extracts  the  ground  parameters  from  the 
NEC  input  file,  creates  a  SOMNEC  input  file,  and  runs  SOMNEC  to  generate  the  interpolation  table.  This  saves 
the  user  from  having  to  create  his  own  SOMNEC  input  file  for  each  run. 

NEC  POST-PROCESSING 

NEC  output  files  contain  a  significant  amount  of  data  other  than  direct  results.  This  data  should  be  examined  as 
it  can  reveal  an  ill  conditioned  model,  and  checking  for  negative  radiated  power  and  antenna  efficiency  are 
performed  by  AutoNEC.  To  reduce  file  size,  a  significant  amount  of  filtering  is  applied.  Data  extraction  filters 
have  been  developed  to  cover  far  field  (radiation  patterns)  as  well  as  near  field  runs.  The  resultant  files  contain 
the  essence  of  the  results  over  the  whole  of  the  frequency  range  investigated,  as  well  as  the  audit  Uail,  and  is  in 
a  form  that  can  be  printed  for  detailed  examination.  Although  there  is  no  geometry  data  iti  this  file,  it  can  be 
recovered  if  necessary  by  reference  to  the  audit  trail,  which  points  to  the  AutoCAD  drawing  file  where  the  model 
was  created. 

RADIATION  PATTERN  GENERATION 

AutoNEC  reads  the  radiation  pattern  data  files  into  AutoCAD  to  create  a  series  of  three  dimensional  antenna 
gain  surfaces,  each  surface  being  automatically  assigned  to  a  separate  drawing  layer.  The  initial  wire  model 
can  also  be  imported  to  show  pattern  alignment.  While  near  field  data  correlates  exactly  in  orientation  and 
scale  with  the  wire  model,  radiation  pattern  data  is  represented  as  a  surface  of  antenna  gain,  specified  over  a 
range  of  spatial  angles,  and  although  there  is  correlation  with  orientation,  there  is  no  scale  relationship  between 
the  model  and  the  antenna  gain  surface.  Thus  when  the  drawing  of  the  model  is  overlaid  on  a  series  of  antenna 
gain  surfaces,  its  scale  can  be  adjusted  with  impunity  to  assist  visualisation  of  pattern  orientation.  As  the 
patterns  are  created  on  different  layers,  these  can  be  displayed  or  hidden  by  the  user  to  show  images  of  single 
radiation  patterns.  Using  standard  AutoCAD  techniques,  these  images  can  be  stored  as  slides  and  a  simple 
animation  created  and  run  within  AutoCAD.  Rendered  views  as  well  as  mesh  representations  of  the  surfaces 
can  be  created. 

RENDERING  AND  ANIMATION  WITH  3D  STUDIO 

3D  Studio  is  one  of  several  commercial  programs  used  to  create  rendered  images  and  animations  of 
geometries.  Used  extensively  In  the  architectural  and  advertising  industries,  it  interfaces  with  AutoCAD  using 
DXF  (Drawing  exchange  Format)  files  to  both  import  and  export  geometries  and  paths.  3D  Studio  cannot 
create  a  renderable  object  using  the  wire  drawing  defining  the  model  for  NEC  as  it  treats  lines  as  paths  rather 
than  surface  edges.  Thus,  if  the  model  is  to  be  imported  into  3D  Studio,  a  3D  FACE  version  of  it  must  be 
created  in  AutoCAD.  This  generally  means  overlaying  the  existing  wire  model  with  a  series  of  3D  FACES. 
Code  is  being  developed  that  will  generate  this  automatically  from  the  wire  model. 
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3D  Studio  EDITORS 


3D  Studio  contains  five  different  editors  which  are  of  interest  in  the  production  of  animations  of  radiation 
patterns: 

2D  Editor  -  This  is  used  for  creation  of  two  dimensional  objects,  and  is  used  in  AutoNEC  for  generation  of  text 
for  annotation  of  images. 

Lofter  -  This  editor  takes  a  two  dimensional  shape  and  lofts  (extrudes)  it  along  a  path.  Thus  text  can  be  given 
depth  to  enhance  its  appearance. 

3D  editor  -  This  editor  reads  the  AutoCAD-generated  DXF  files  containing  the  radiation  pattern  data.  It  is  also 
where  scene  (the  view  of  the  objects)  is  created.  Lights,  cameras,  backgrounds  and  other  scenic  parameters 
are  initially  defined  here,  and  materials  (colours  and  textures)  are  assigned  to  the  various  objects  in  the  scene. 
If  the  model  geometry  is  simple,  it  can  often  be  drawn  directly  in  this  editor.  Alternatively,  if  a  suitable  3D 
FACE  model  has  been  created  in  AutoCAD,  it  can  be  imported  from  a  DXF  file.  Textual  data  created  in  the  2D 
Editor  or  Lofter  is  transferred  to  annotate  the  scene.  This  editor  can  create  still  image  files  in  a  variety  of 
standard  formats  including  GIF,  TIG,  TARGA,  JPEG,  These  are  suitable  for  printing,  and  for  inclusion  into 
presentation  packages  such  as  Powerpoint,  as  well  as  word  processors. 

Materials  Editor  -  This  editor  provides  a  means  of  creation  and  modification  of  materials  to  be  assigned  to 
objects  in  the  scene.  3D  Studio  Version  4  comes  with  some  183  materials  in  its  standard  library  and  any  of 
these  can  be  modified  in  the  materials  editor.  Alternatively  the  user  can  create  his  own  materials.  A  wide 
range  of  material  parameters  can  be  adjusted  including  texture,  transparency,  bump  and  reflection  maps, 
shininess,  ambient,  diffuse  and  specular  colour,  and  shading  methods.  Of  the  standard  materials  provided, 
Blue  Marble  has  been  found  to  be  the  most  useful  for  radiation  pattern  depiction.  When  properly  attached  to  a 
radiation  pattern  geometry,  the  rendered  image  has  a  series  of  lines  radiating  from  the  centre  of  the  pattern 
which  enhance  its  visual  interpretation.  Alternatively  the  user  may  decide  to  create  a  library  of  materials  more 
suited  to  his  needs,  and  a  material  specifically  for  radiation  pattern  display  is  being  developed  at  ATEA. 

Keyframer  -  This  editor  allows  the  various  steps  in  an  animation  sequence  to  be  set  up.  Of  particular  interest 
is  the  morphing  capability.  Having  imported  (say)  twenty  eight  radiation  pattern  geometries  covering  the  HF 
band,  the  lowest  frequency  pattern  is  morphed  in  subsequent  image  frames  so  that  it  adopts  the  shape  of  the 
patterns  at  other  frequencies.  If  morphing  is  done  between  other  than  successive  frames,  3D  Studio  performs  a 
spline  interpolation  between  successive  shapes  to  generate  intermediate  images.  As  well  as  creating 
animation  files,  the  keyframer  can  create  series  of  still  images  in  any  of  the  formats  available  in  the  3D  editor, 
file  names  being  derived  from  the  source  file  name  and  fhe  frame  number. 

GENERATING  ANIMATIONS 

The  generation  of  good  quality  animations  requires  a  considerable  amount  of  effort,  patience  and  experience. 
Unless  the  user  is  familiar  with  both  AutoCAD  and  3D  Studio,  a  great  variety  of  errors  and  frustrations  await  the 
unwary.  Drawing  in  three  dimensions  is  an  art  in  itself.  Unlike  AutoCAD,  3D  Studio  does  not  yet  support  a 
language  that  will  allow  the  user  to  create  his  own  procedures,  Many  similar  and  repetitive  steps  are  taken 
during  the  creation  of  each  animation.  Performing  these  under  program  control  rather  than  manually  would 
considerably  reduce  the  effort  required,  as  well  as  the  number  of  mistakes  that  occur  along  the  way.  It  is  hoped 
that  some  programmability  will  be  built  into  future  versions  of  3D  Studio. 

One  of  the  prime  requirements  for  animation  creation  is  a  fast  computer,  with  significant  amounts  of  RAM  and 
file  storage.  ATEA  is  currently  using  a  90  Mhz  Pentium  machine,  with  64  Mbyte  of  RAM,  2  Gbyte  of  disk  and 
DAT  tape  backup.  Even  with  this  speed,  some  animations  have  required  over  ten  minutes  per  frame,  and  a 
week  to  create  a  60  Mbyte  animation  file.  With  this  sort  of  investment  in  processing  time,  rework  must  be 
avoided  by  careful  checks  made  at  strategic  points  during  the  creation  process. 
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In  broad  terms,  the  following  steps  are  typical  of  those  performed  in  creating  a  simple  animation  of  radiation 
patterns: 

1 .  Create  model  in  AutoNEC  and  generate  NEC  input  files. 

2  Run  NEC  and  extract  the  data  from  the  NEC  output  files. 

3  Create  a  series  of  3D  Surfaces  in  AutoCAD,  representing  the  radiation  patterns. 

4’  Transfer  the  surfaces  to  3D  Studio  using  a  DXF  file.  *  ■ 

5  In  the  3D  Editor,  create  the  scene  for  the  pattern  (lights,  cameras,  background,  matenals,  text  etc) 

6*  In  the  Keyframe’r  hide  all  but  one  (the  first)  radiation  pattern  surface. 

7  Morph  this  pattern  to  the  other  patterns  at  successive  frames. 

8.  Run  the  Tenderer  to  create  the  animation. 

The  process  is  basically  simple.  Familiarity  with  the  multitude  of  scenic  variables  that  can  be  used  in  scene 
creation  is  essential  to  creating  good  quality  animations  and  images,  A  sense  of  the  theatrical,  and 
knowledge  of  lighting  and  camera  techniques  is  also  of  considerable  use,  particularly  when  preparing  training 
material  where  the  most  dramatic  images  tend  to  leave  the  long-lasting  impressions. 

PLANNED  ACTIVITIES 

The  Australian  Army’s  current  single  channel  radio  system  (RAVEN)  includes  a  f  bS® 

user  presently  relies  on  limited  radiation  pattern  data  in  the  printed  documentation  as  his  basic 

communications  training.  ATEA  is  preparing  training  aids  to  remedy  this 
compression  techniques,  almost  50,000  images  of  antenna  patterns  can  be  stored  ^ 

(CD)^  Using  a  mouse  to  adjust  frequency,  viewing  angle  and  other  parameters,  the  aids  will  provide  an 

interactive  visualisation  of  radiation  patterns. 

Because  of  the  large  distances  involved,  the  Australian  Army  makes  extensive  use  of  HF 

and  data  in  its  operations  in  the  north  west  of  the  continent.  The  ionospheric  prediction  Program  ASA 

(Advanced  Stand  Alone  Prediction  System),  is  similar  to  the  US  IRONCAP,  and  is  used  to 

frequency  allocations  under  the  prevailing  ionospheric  conditions.  Antennas  characterised  n 

ASAPS  do  not  well  match  those  in  service,  and  preparation  of  more  appropriate  antenna  ^ 

way.  initial  issues  of  these  files  have  already  demonstrated  improvements  in  HF  communications  availability. 

Other  work  includes  a  study  of  the  design  and  performance  of  aircraft  HF  antennas.  Particularly  ^^ose  mounted 
on  helicopters  -  one  of  the  worst  possible  platforms  for  an  HF  antenna.  This  work 
investigations  into  different  and  hopefully  improved  antenna  configurations,  but  will  provide  pattern 
visualisation  to  assist  pilots  to  a  better  understanding  of  their  HF  communications  anomalies. 

Some  work  has  been  done  on  visualising  near  field  data.  Early  attempts  to  display  this  four-  dimensional  data 
used  a  series  of  cross-sections,  (see  figure  2)  and  were  of  some  use.  However  a  significant  number  of  such 
cross-sections  were  needed  for  visualisation  of  the  whole  field  structure  in  three  dimensions.  VVork  with  3D 
studio  using  semi-transparent  materials  (glasses)  to  create  three  dimensional  imaps  of  near  shown 

some  promise.  The  materials  represent  three  dimensional  contour  surfaces,  like  the  °™°".  fS 

transparent  and  of  different  colours,  the  internal  structure  can  be  seen.  A  method  of  transforming  near  field 
data  to  the  form  needed  by  3D  Studio  is  yet  to  be  devised. 

VALIDATION 

There  is  a  continuing  need  to  improve  methods  of  integrity  and  robustness  checking  of  models.  Asse^rnent  of 
resute  agaS  measurements  are  an  essential  part  of  mis  process.  Validation  data  are  scrut.ni^^^^^^^^ 
find  weaknesses  in  apparent^  good  models  and  extend  modelling 

documentation.  Validation  of  near  field  data  againsf  actual  m^suremenl  of  fte  Land  Rove 

RADHAZ  investigation,  and  some  modifications  were  made  to  the  initial  model.  Good  (i.e.  within  a  few  db) 
correlation  is  now  being  obtained.  Checks  against  antenna  impedance  and  coupling  also  show  promise  of  a 
robust  model  Validation  of  far  field  data  is  a  different  proposition  as  measurements  require  large  physical 
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separation  in  three  dimensions  between  the  source  and  the  measurement  points  and  present  considerable 
logistic  problems.  Scale  models  are  a  possible  alternative  and  1/10  scale  model  of  Black  Hawk  fuselage  has 
been  constructed  for  this  purpose.  Results  from  this  work  should  not  only  enhance  confidence  in  the  computer 
modelling,  but  also  lead  to  improvements  in  the  NEC  pre-processor  in  flagging  dubious  model  constructions. 
Thus  a  continuous  improvement  in  models  will  be  achieved,  without  imposing  any  significant  overhead  on  the 
modeller. 

create  the  images  in  this  paper  and  in  the  projected  images  accompanying  the  presentation 
although  not  specifically  validated,  is  generated  from  robust  models  of  antennas,  and  are  thus  believed  to  be 
accurate.  Such  accuracy,  however,  is  not  fundamental  to  the  thmst  of  this  paper,  which  is  to  demonstrate  the 
potential  of  animation  packages  such  as  3D  Studio  in  the  field  of  electromagnetic  visualisation. 

QUIRKS 

One  initially  puzzling  aspect  of  morphing  in  3D  Studio  was  that  the  radiation  pattern  moved  during  the 
morphing,  even  though  all  the  patterns  from  which  the  sequence  was  created  had  their  origins  fixed  at  0,0,0.  It 
was  eventually  found  that  the  3D  Studio  morphing  algorithm  maintains  the  geometric  centre  of  the  morphed 
object  at  the  geometric  centre  of  the  Initial  object.  Thus  morphing  from  a  initial  pattern  with  little  vertical 
development  to  one  with  significant  vertical  development  would  cause  the  origin  of  the  resultant  patterns  to 
move  downwards.  Initially,  the  positions  of  the  patterns  were  hand  corrected  in  the  keyframer  to  maintain  a 
fixed  origin,  requiring  successive  frame  by  frame  adjustments  in  three  directions  -  a  tedious  and  exacting 
process.  Once  the  reason  for  this  behaviour  had  been  identified,  the  radiation  pattern  creation  procedure  in 
AutoCAD  was  modified  to  generate  a  path  derived  from  the  geometric  centres  of  the  pattern  shapes.  This  path 
wasjhen  used  in  the  keyframer  to  adjust  the  geometric  centres  of  the  morphed  objects  and  maintain  the  origin 

Another  curious  behaviour  was  found  when  using  the  entity  selection  process  in  AutoCAD.  When  the  selection 
process  is  invoked  during  DXF  file  creation,  AutoCAD  Version  12  apparently  modifies  the  order  effaces.  When 
3D  Studio  morphs  between  objects  across  multiple  frames,  it  creates  intermediate  objects  by  geometric 
interpolation,  thus  the  face  order  is  important.  Objects  (field  patterns)  which  appeared  quite  normal  when 
imported  into  3D  Studio,  showed  gross  distortions  during  interpolation  as  a  result  of  face  order  inconsistencies. 
The  solution  was  not  to  use  entity  selection,  even  though  this  results  in  larger  DXF  files. 

IMAGES 

The  following  images  are  necessarily  printed  in  grey  scales.  When  reproduced  in  full  colour  by  3D  Studio  the 
visualisation  of  the  Images  is  dramatically  enhanced.  Animation  further  adds  to  the  realism. 


Figure  4:  Result  of  AutoCAD 
entity  selection  process. 

3D  Studio  has  created  the  centre 
shape  by  interpolation  between 

the  left  and  right  radiation 

patterns.  The  selection  process 

within  AutoCAD  has  not 

preserved  the  order  of  faces 

between  the  left  and  right 

patterns,  resulting  in  the  gross 

distortions  during  morphing,  as 

shown  in  the  centre  shape. 
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Figure  5:  Rendered  image  of  far  field  data.  This  image  is  one  of  many  thousands  produced  for  the 
bidirectional  delta  antenna  used  in  the  field  by  the  Australian  Army.  The  suite  of  images  will  be  stored  on  a 
compact  disk  for  display  (in  colour)  as  a  training  aid  for  signals  personnel. 

CONCLUSIONS 

The  development  of  the  techniques  described  in  this  paper  have  already  demonstrated  significant  potential  in 
visualisation  of  electromagnetic  phenomena.  The  pilot  training  aid  as  well  as  computer-based  and  video-tape 
animation  displays  have  aroused  considerable  interest  within  Army  and  other  areas  within  the  Australian 
Department  of  Defence  as  well  as  overseas.  The  visualisation  techniques  are  not  limited  to  electromagnetics, 
but  could  be  used  to  display  virtually  any  static  or  varying  phenomena,  provided  that  an  adequate  mathematical 
description  is  available  which  will  allow  the  shape(s)  of  the  phenomena  to  be  created  within  AutoCAD. 

It  is  expected  that  the  electromagnetic  visualisation  provided  by  the  antenna  training  aids  will  significantly 
contribute  to  better  field  communications.  Their  use  stimulates  an  ‘antenna  intuition’  previously  resulting  only 
from  long  years  of  field  experience  and  generally  found  only  amongst  long  retired  communications  personnel. 


333 


Performance  of  Multiple,  Thin  Layers  of  Lossy  Dielectrics  as  Broadband  Attenuators 


George  W.  Jarriel  Jr.,  Michael  E.  Baginski,  Lloyd  Riggs 
Department  of  Electrical  Engineering 
200  Broun  Hall  •  Auburn  University,  Alabama  36849 

January  9,  1996 


Abstract 

In  the  past,  single  layers  of  radar  absorbers  have  been  used  to  attenuate  unwanted  reflections  due  to 
an  incident  electromagnetic  wave  on  a  conductor  over  a  narrow  range  of  frequencies.  However,  recent 
advances  in  materials  sciences  have  made  the  use  of  multiple,  thin  layers  of  lossy  materials,  each  with 
different  electrical  constituitive  paramters,  to  create  broad-band  radar  absorbers  feasible.  Properly 
applied,  this  would  ideally  lead  to  a  significant  increase  in  performance  over  single  layer  absorbers. 

This  paper  focuses  on  a  numerical  model  for  determining  the  attenuation  of  unwanted  reflections 
due  to  multiple  layers  of  radar  absorbers  coated  on  a  perfectly  electric  conductor. 


I  Introduction 

The  suppression  of  unwanted  reflections  off  metal  structures  is  often  an  important  consideration  when 
introducing  a  transmitter/reciever  set  into  a  system.  Reflections  from  other  antennas,  metal  buildings, 
and  nearby  structures  can  impair  the  performance,  and  even  damage  a  radar  system.  Possible  methods 
for  reducing  reflections  include  altering  the  geometry  of  the  system,  distancing  the  transmitter/reciever 
from  the  reflecting  surface,  and  coating  the  reflector  with  a  radar  absorbing  materials. 

In  order  to  effectively  implement  a  RAM  solution  to  reflection  problems,  care  must  be  taken  in  order  to 
determine  the  proper  material  for  the  job.  Unfortunately,  the  performance  of  many  RAM  are  dependent 
on  the  geometry  of  the  structure  they  are  applied  to,  as  well  as  the  frequency  of  the  incident  energy, 
material  constituitive  parameters  and  RAM  depth.  Complex  geometry  is  diflicult  to  simulate 

alone,  and  extremely  computationally  intensive  when  a  lossy  coating  is  applied. 

A  simpler  geometry,  the  flat  plate  coated  with  RAM  is  simple  to  calculate,  requires  minimal  com¬ 
putational  resources,  but  only  gives  the  investigator  raw  RAM  performance  under  ideal  circumstances. 
This  says  very  little  about  the  materials  performance  on  more  complex  geometries. 

A  infinitely  long,  right  circular  cylinder  could  be  considered  a  compromise  geometry.  Using  a  numer¬ 
ically  exact  infinite  series  method  to  calculate  the  fields  due  to  a  coated  cylinder  requires  a  negligible 
amount  of  CPU  time,  and  at  the  same  time  gives  a  greater  sense  of  scale  to  the  investigator,  allowing 
for  inferences  to  possible  behavior  on  more  complex  geometries. 

This  paper  is  concerned  with  presenting  a  method  for  the  prediction  of  absorber  performance  for 
materials  with  graded  constituitive  parameters.  This  model  will  then  be  used  to  calculate  the  radar  cross 
section,  RCS,  of  three  different  material  parameter  profiles.  First  as  a  single  homogeneous  layer,  secondly 
as  multiple  thin  layers  where  the  constituitive  parameters  are  graded  linearly  as  shown  in  Figure  1,  and 
finally  multiple  thin  layers  where  the  constituitive  parameters  are  graded  exponentially.  These  thin  layers 
can  be  taylored  with  a  specific  set  of  desired  constituitive  parameters  by  methods  such  as  suspending  an 
lossy  material  in  powder  form  in  a  matrix  (such  as  a  polymer)  [1]. 
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Figure  1:  Profile  of  constituitive  parameters  for  linearly  graded,  exponentially  graded,  and  homogeneous 
dielectrics  analyzed. 

While  more  difficult  to  fabricate,  materials  with  continuously  graded  constituitive  parameters  can  be 
used,  if  properly  designed,  to  attenuate  a  greater  range  of  frequencies  than  a  single  layer  of  homogeneous 

RAM.  .  .  •  ^ 

Due  to  the  large  number  of  variables  involved,  this  paper  wiU  focus  on  concentric  coatings  where 
the  grading  of  the  constituitive  parameters  is  dependent  only  on  the  distance  from  the  center  of  the 
PEC  cylinder,  p.  Oblique  incidence,  non-circular,  symetric  cylinders,  and  non-concentric  coatings  can 
be  investigated  using  the  same  methods  described  here,  but  are  beyond  the  scope  of  this  paper. 

II  Analytic  Formulation 

Shown  in  Figure  2  is  a  TM*  electric  field  incident  on  an  infinitely  long  (no  z-variation)  wire  with  N 
concentric,  dielectic  coatings  (regions  1  to  N)  where  each  region  has  constituitive  parameters  and 

k^).  From  Harrington  [2]  we  know  that  the  total  electric  field  for  the  N+lth  region  (free  space)  is 
defined  as 

-t-oo 

=  eU+i  +  Kn+1  =  Eo  r^{Jr,{kN+ip)  +  anHi^\kN+ip))e^^  (1) 

n=-oo 

and  for  regions  m<N,  the  total  electric  field  is  defined  as 

-1-00 

-b  =  Eo  X)  r^^iKrnMkrnP)  +  C^,rnNi^\krnP))e^^'^  (2) 

n=— oo 

where  the  subscript  m  denotes  the  region,  E^  is  the  z-component  of  the  total  electric  field,  E^  and 
E'^  are  the  incident  and  scattered  z-directed  fields  respectively,  and  Eq  is  the  magnitude  of  the  incident 
field.  Jn  is  the  bessel  function  of  the  first  kind,  JV„  is  the  Neuman  function,  and  represents  the 
hankel  function  of  the  second  kind.  An,  fen.m,  and  Cn.m  are  scattering  coefficients  modifying  the  bessel 
functions.  Solving  for  the  coefficients  The  boundary  conditions  at  each  interface  scattering  coefficients 
are  determined  by  applying. 

Region  m{P  —  P'i-)  ~ 

E*. flexion  miP  =  Pm+l)  =  K^egion  m+liP  =  Pm+l)  for  all  Pm  2<m<N-bl  (4) 
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Figure  2:  Geometry  of  Problem.  Circular  PEC  cylinder  coated  with  N  layers  of  dielectic  material. 


miP  ~  Pm+l)  —  Region  m+liP  ~  Pm+l)  fo?  Pm  2<m<N+l 
Application  of  Maxwell’s  equations  and  these  boundry  conditions  to  1  and  2  yield 

a  =  -  I  -  Jh{kN+ipif+i)Dn,N+l  ^ 


if  m=l 
otherwise 


TP  —  _  r  ’^TijkmPm)  Jni^rnPm)Dn,m  \ 

[N^ik^p^)-KikmPm)Dn,m) 

In  order  to  determine  a^,  must  be  determined.  Once  found,  can  then  be  used  to  solve  for  D„_2 
which  is  used  to  solve  for  F„,2-  This  process  is  repeated  until  Dn,N+i  is  determined,  at  which  point  On 
can  be  calculated.  Once  the  coefficient  o„  has  been  determined,  the  scattered  field,  E|,  is  known,  and 
so  the  RCS  can  be  calculated. 

A  similar  method  to  that  outlined  above  may  be  employed  to  determine  scattered  fields  when  the 
RAM  coated  wire  is  illuminated  by  a  TE  plane  wave. 

The  formula  for  the  RCS  is  defined  as 


RCS  =  lim  2Trp- — ~  (9) 

E*  is  usually  to  be  unity,  and  furthermore,  in  the  rar  field  asymptotic  formula  for  the  hankie  function 
of  the  second  kind  reduces  the  formula  to  a  readily  calculable  form. 
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For  initial  code  verification,  the  formulas  presented  here  were  used  to  calculate  the  RCS  for  various 
cases  with  a  lossless  coating.  Results  for  the  lossless  coating  were  in  excellent  agreement  with  RCS 
predictions  computed  using  other  methods  [5]  [6]. 

To  test  possible  errors  introduced  due  to  the  large  number  of  layers  surrounding  the  cylinder  necessary 
to  simulate  continuous  media,  the  RCS  of  a  cylinder  was  calculated  when  coated  with  sixty  thin  layers  of 
free  space  {fir  =  l,er  =  !)•  The  results,  shown  in  figure  3,  were  compared  with  a  known  solution  [4],  and 
showed  excellent  agreement.  This  simulation  confirms  that  the  series  of  bessel  functions  is  an  adequate 
approximation,  provided  that  orders  up  to  10  are  included. 


Figure  3:  Comparison  of  RCS  of  bare  cylinder  for  known  solution  (-)  and  RCS  of  cylinder  surrounded 
by  60  thin  layers  of  free-space  (o-o). 

The  number  of  shells  required  to  adequately  simulate  a  continuously  graded  media  is  dependent  on 
two  factors,  maximum  frequency  simulated  and  peak  constituitive  parameters  of  the  absorber.  Shown  in 
figure  4  is  the  RCS  for  a  coated  cylinder,  where  a  continuous,  exponentially  graded  absorber  is  simulated 
using  N  layers.  As  the  number  of  layers  increase,  the  solution  converges.  However,  it  can  be  seen  from 
figure  4  the  solutions  agree  initially  at  lower  frequecies  for  a  smaller  number  thin  layers.  Therefore,  if 
the  frequency  range  of  interest  were  to  be  expanded,  the  number  of  layers  required  to  effectively  simulate 
the  continuous  material  would  also  have  to  be  increased.  Increasing  the  peak  values  for  the  constituitive 
parameters  would  also  require  an  increase  in  the  number  of  layers  in  the  model. 


Ill  Numerical  Results 

Shown  in  figure  5  is  an  example  of  typical  performance  due  to  exponentially  and  linearly  graded  ab¬ 
sorbers.  The  PEC  cyUnder  is  2  cm  in  diameter  with  a  2  mm  layer  of  material.  The  RAM  has  a  complex 
permitivity  of  9.511-10.8679  and  a  complex  permeability  of  0.9511-10.09.  Both  graded  materials  were 
simulated  using  60  thin  layers  of  equal  radius.  The  geometry  is  illuminated  by  a  normally  incident  TM 

Both  graded  materials  give  attenuation  over  a  broad  range,  but  the  linearly  graded  material  has 
better  overall  performance.  It  is  important  to  note  at  this  point  that  data  studied  to  date  shows  greater 
performance  from  a  single  homogeneous  layer  with  the  contituitive  paramters  equal  to  the  peak  values  of 
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Figure  4:  RCS  of  a  0.1m  cylinder  coated  with  an  exponentially  graded  absorber  simulated  with  N  layers 
of  homogeneous  material. 


the  graded  materials  for  some  geometries.  This  is  heavily  dependent  on  system  geometry  and  material 
parameters. 

IV  Summary 

A  model  was  presented  for  the  determination  of  the  fields  and  RCS  of  a  right  circular  cylinder  surrounded 
by  a  layer  of  lossy  material  where  the  constituitive  parameters  vary  continuously  with  the  radius,  p.  By- 
separating  the  material  in  multiple  thin  layers  of  homogeneous  material  to  approximate  the  continuous 
material,  an  accurate,  computationally  efficient  model  was  developed. 

Data  was  shown  suggesting  that  a  model  for  using  graded  materials  can  be  useful  in  the  determination 
of  constituitive  parameter  profiles  for  enhanced  attenuation  of  reflections.. 

Areas  if  future  research  include  determining  the  bistatic  RCS  of  graded  RAM. 
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REF 


Figure  1.  REF  CEM  Data  Dictionary 


BACKGROUND 

The  Research  and  Engineering 
Framework  (REF)  manages  data  for  a  variety 
of  design  and  analysis  codes.  Each  of  these 
different  design  and  analysis  codes  has  its  own 
data  requirements  (data  model)  for  input  and 
output.  The  method  that  the  REF  uses  to 
handle  these  different  data  models  is  through 
the  use  of  a  data  dictionary.  By  utilizing  the 
data  dictionary  REF  users  will  have  the  ability 
to  switch  between  data  models  thereby 
transforming  the  generic  REF  into  a  discipline- 
specific  design  system. 

This  paper  discusses  the  ongoing  effort 
to  design  and  develop  a  Computational 
Electromagnetic  (CEM)  specific  data 


dictionary  for  the  REF  as  shown  in  Figure  1.  Types 
of  data  covered  by  this  dictionary  will  include:  CEM 
analysis  techniques;  geometry  elements;  material 
properties;  excitation  information;  and  observables 
Information.  The  CEM  data  dictionary  is  being 
developed  as  a  baseline  data  dictionary  for  the  wide 
variety  of  CEM  codes  available.  A  developer  need 
only  tie  his/her  application-specific  code  into  the 
REF  directly  or  through  wrappering  to  access  this 
data  dictionary'. 

Figure  2  shows  a  fundamental  structure  of 
the  ty'pe  of  CEM  data  base  we  have  developed.  This 
data  base  layout  has  as  its  "leaves",  or  lowest  nodes, 
a  CEM  description  of  a  physical  object  for  analysis. 

The  Data  Dictionary  design  is  being 
developed  to  be  used  as  a  template  to  enable 
existing,  or  legacy,  CEM  tools  to  be  interfaced  with 
the  REF  as  shown  in  Figure  3.  These  codes  as  well 
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Figure  2.  CEM  Database  for  the  REF 
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Figure  3.  CEM  Code  Interface  with  the  REF 


as  new  CEM  tools 
will  be  able  to  be 
fully  integrated  into 
the  REF.  This  is 
possible  because  of 
the  supporting  set  of 
development  APIs 
(Application 
Programmer 
Interfaces)  in  the 
REF.  These  APIs 
allow  other  existing 
codes  to  be  modified 
to  access  the 
databases  in  the  REF. 
There  are  also  APIs 
that  provide  the 
developer  with  a 
robust  set  of  code 


that  allows  programs  to  perform  graphics,  gridding,  etc.  These  APIs  allow  the  developer  to  concentrate 
on  the  EM  d^evdopment  mther  than  worrying  how  he/she  is  going  to  store  and  visualize  the  data.  Figure 
4  shows  how  the  codes  will  be  integrated  with  the  REF. 


DATA  DICTIONARY  DESCRIPTION 

A  data  dictionary  is  a  description  of  all  the  databases  composing  a  data  base  system.  The  content 
of  the  data  dictionary  may  best  te  thought  of  as  "data  about  the  data"-that  is  descnpuons  of  all  of  the 
other  objects  (e.g.,  flies,  progratus,  and  so  on)  in  the  system.  In  particular,  a  data  dictionary  stores  all  the 
various  Schemas  and  file  specifications  and  their  locations.  The  dictionary  is  consulted  before  actual 
data  is  read  or  modified  in  the  data  base  system.  A  complete  data  dictionary  also  includes  information 


about  which  programs 
use  which  data  and 
which  users  are 
interested  in  which 
reports.  The  data 
dictionary  is  frequently 
integrated  into  the 
system  it  describes. 

A  relational 
data  base  system  needs 
to  maintain  data  about 
the  relations  among  the 
data.  Among  the  types 
of  information  the  data 
dictionary  stores  on  the 
system  are; 

•  Names  of 


Figure  4.  CEM  Code  Integration  with  the  REF 


the  relations 

•  Names  of  the  attributes  of  each  relation 


•  Domains  of  attributes  _ 

•  Names  of  views  defined  on  the  data  base,  and  the  definition  of  those  views 

•  Integrity  constraints  for  each  relation  (e.g.,  key  constraints) 
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CEM  DATA  DICTIONARY  DESIGN  CONSIDERATIONS 


When  developing  a  data  base  for  tools  that  exist  and  for  codes  that  haven't  been  written  yet  there 
are  a  number  of  considerations  to  examine.  For  this  effort  there  are  three  general  considerations: 

1 .  Do  we  design  a  data  base  that  incorporates  all  of  the  specific  data  types  that  are  used  by  CEM 
codes  today  which  may  limit  our  expansion  capability? 

2  .  Do  we  design  a  data  base  that  uses  only  generic  entities  that  answer  the  need  of  a  wide 

variety  CEM  codes  and  will  allow  future  codes  to  be  incorporated  easily?  But  current  codes 
will  require  additional  coding  to  incorporate  them  into  this  framework. 

3  .  Do  we  approach  the  design  of  the  data  base  as  a  combination  of  the  best  of  both  of  the  first 

two  options? 

The  first  option  above  at  the  outset  appears  to  be  the  most  logical  step  when  designing  the  data 
base.  All  one  needs  to  do  is  to  survey  the  CEM  community  and  get  a  complete  inventory  on  the  types  of 
data  that  are  required  for  each  code  and  then  build  an  appropriate  description  (table)  to  hold  the  data. 
However,  this  approach  brings  with  it  two  major  concerns.  The  first  is  that  the  volume  of  data  types  will 
soon  become  unmanageable,  and  the  second  is  that  many  of  the  data  elements  will  be  found  to  have 
common  areas  covered  by  other  data  elements  causing  redundancy  in  the  data  base  which  is 
uiidesireable. 

The  second  of  the  two  approaches  provides  the  most  flexibility  in  designing  the  data  base.  The 
CEM  data  entities  can  be  modeled  such  that  the  layout  of  the  data  base  is  extremely  flexible.  For 
example,  instead  of  separate  entities  for  patches,  plates,  and  facets  a  new  surface  area  element  could  be 
designed  such  that  any  of  the  other  entities  could  be  generated  from  it.  Also  an  element  like  the  surface 
area  could  be  used  along  with  the  gridding  interface  in  the  REF  to  generate  a  MoM  wire  mesh. 
Therefore,  with  this  approach  the  amount  of  data  that  would  need  to  be  stored  would  be  greatly  reduced. 

There  are  two  concerns  with  this  approach  however,  the  first  being  that  any  existing  code  would 
have  to  either  be  modified  extensively  or  a  very  robust  wrapper  would  need  to  be  generated  to  translate 
these  new  generic  entities  into  the  more  traditional  CEM  entities.  The  second  concern  with  the  general 
approach  is  that  the  contents  of  the  data  base  may  be  less  intuitive  to  a  user  who  is  browsing  through  a 
group  of  models.  The  surface  area  means  little  unless  the  user  knows  a  priori  that  this  is  a  MoM  wire 
grid  model  and  the  surface  will  be  subdivided  by  a  wire  gridding  algorithm. 

The  last  approach  would  utilize  the  best  of  both  the  specific  cases  of  CEM  data  types  and  the 
more  general  approach.  By  designing  a  data  base  that  incorporates  existing  data  types  we  have  a  very 
intuitive  data  base  that  can  be  used  by  today’s  codes.  Also  having  the  general  data  types  like  the  surface 
area  entity  future  codes  can  be  written  to  take  advantage  of  this  data  abstraction  and  utilize  the  power  of 
the  utilities  and  libraries  in  the  REF. 

CEM  DATA  DICTIONARY 

The  CEM  data  dictionary  is  being  developed  such  that  the  data  is  logically  grouped  into 
functional  areas.  The  specific  areas  that  were  addressed  are  Input  Commands,  Input  Geometry,  and 
Output  Commands.  Each  of  these  areas  was  broken  down  further  into  more  manageble  subareas  that 
were  then  used  to  create  the  tables  and  data  elements  in  the  tables.  Figures  5  through  10  show  the  tables 
that  were  constructed  for  the  CEM  data  dictionary  and  their  relationships  with  each  other. 

The  trouble  with  creating  a  CEM  data  base  is  ensuring  that  it  is  generic  for  any  of  the  existing 
codes  and  flexible  enough  to  be  easily  expanded  to  incorporate  new  advances  in  CEM.  In  creating  a 
generic  CEM  data  base  some  CEM  code  specific  constructs  are  not  available.  This  is  because  these 
constructs  would  have  little  or  no  meaning  to  another  code.  An  example  of  one  of  these  constructs  is  a 
geometry  reflection  command  that  is  allowed  by  codes  like  NEC  MoM  and  GEMACS.  Other  codes  like 
CARLOS  3D  don’t  recognize  a  reflection  command  and  therefore  an  analysis  would  potentially  produce 
invalid  results.  Two  possible  solutions  to  this  problem  include  storing  the  complete  geometry  after  the 
reflection  command  has  been  executed  or  by  making  a  note  in  a  comments  section  that  the  geometry  that 
is  stored  will  need  to  be  reflected  in  some  manner  prior  to  its  use.  Both  of  these  solutions  are  possible 
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with  the  data 

dictionary  of  this 
effort.  The  former 
would  be  handled  by  a 
preprocessor  that 
would  create  the 

correct  geometry, 
whereas  the  latter 
solution  is  handled  by 
a  comments  entry  in 
the  geometry  table 
described  later. 

Figure  5 
show's  the  Master 

Table  and  how  it 
provides  access  to  the 

other  tables  in  the  data  base.  In  the  data  base  the  individual  analyses  are  uniquely  identified  by  their 
AnalysisJD  in  the  Master  Table.  There  are  other  columns  in  the  table  where  an  analysis  title  can  be 
added,  and  comments  about  the  analysis  can  be  added.  The  other  columns  in  the  table  are  IDs  that  point 
to  other  tables  in  the  data  base  where  the  remainder  of  the  analysis  information  is  stored.  These  other 
tables  include  the  Sources  Table,  the  Fields  Table,  the  Geometry  Region  Table  and  the  Frequency  Table. 
Each  of  these  tables  are  described  in  the  following  paragraphs  and  figures. 

The  Sources  Table  and  its  related  tables,  shown  in  Figure  6,  are  used  to  store  the  information 
necessary  to  create  an 
excitation  command 
for  the  CEM  analysis. 

There  are  three  source 
types  defined  in  the 
data  base.  The  Pattern 
Source  table  stores 
information  to  be  able 
to  use  previously 
generated  field  data 
(measured  or 

simulated)  as  the 
excitation  source.  The 
Electric  Field  table  is 
used  to  store 
information  to  create  a 
fixed  or  stiff  source  at 
a  given  location  and 
orientation.  The 
Impressed  Source 
Table  stores 

information  to  create  an  excitation  that  can  be  applied  to  some  portion  of  the  structure  such  as  a  voltage 
on  a  wire  or  patch.  In  all  three  of  these  tables  there  are  coordinate  system  parameters  that  allow  for 
placement,  polarization  and,  if  necessary,  orientation  of  the  source. 

The  Assoc_Sources  Table  provides  the  capability  of  storing  more  than  one  source  per  analysis. 
An  example  of  an  analysis  requiring  multiple  sources  would  be  the  simulation  of  an  array  antenna  where 
each  of  the  individual  elements  in  the  array  is  represented  by  an  Electric  Field  source. 

Similar  to  the  set  of  tables  for  source  is  the  Fields  Table,  Figure  7,  and  its  related  tables.  These 


Figure  6.  CEM  Data  Dictionary  Sources  Table  and  Relations 


Figure  5.  CEM  Data  Dictionary  Master  Table  and  Relations 


343 


Figure  7.  CEM  Data  Dictionary  Fields  Table  and  Relations 


tables  store  the 
information  necessary 
to  create  the  command 
to  identify  where  field 
data  are  to  be 
generated  in  the 
analysis.  The  results 
of  the  field  command, 
the  solution  data, 
would  be  stored  in 
another  data  base. 

Two  types  of 
field  command 

information  can  be 
stored  in  the  data  base. 
Electric  Field  and 
Magnetic  Field.  As 
with  the  sources  the 


field  tables  store  information  on  the  location  and  coverage  of  the  requested  fields.  Also  multiple  field 
commands  can  be  stored  via  the  Assoc  Fields  Table. 


Scalar 
frequencies  or 

frequency  ranges  are 
stored  in  the 
Frequency  Table, 
Figure  8.  The 
Assoc_Frequency 
Table  allows  multiple 
scalar  frequencies  or 
ranees  to  be  stored 


with  a  single  analysis. 

Nonuniformly  spaced  scalar  frequency  values  could  be  stored  in  this  manner. 


The  Geometry  Region  tables,  Figure  9,  and  relations  provide  storage  for  the  structure  under 

analysis.  The  storage  of 
multiple  regions  allows  the 


user  to  build  the  model  in 
reusable  sections  such  as 
wings,  fuselage,  stabilizers, 
etc.  It  also  allows  for  multi- 
region  (i.e.,  external  vs. 
internal)  analyses  to  be  stored 
for  analysis  by  one  code  or  a 
set  of  codes. 

The  Geometry  Table 
and  its  related  tables,  shown  in 
Figure  10,  stores  the  low  level 
geometric  description  of  the 
structure.  This  description  is 


Figure  9.  CEM  Data  Dictionary  Geometry  Region  Table  and 
Relations 


in  CEM  model  terms,  i.e., 
wires,  plates,  patches, 
cylinders,  etc,  to  facilitate  the 
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“Geometry” 


ease  of  interfacing 
existing  CEM  codes 
with  the  REF.  As 
these  codes  are 
integrated  into  the 
REF  to  utilize  more  of 
the  REF  APIs,  it  is 
envisioned  that  the 
geometry  tables  will 
progress  toward  one  of 
the  standard  CAD  data 
formats  like  IGES. 

SAMPLE 

DATABASE 

A  sample 
CEM  data  base  was 
constructed  using  the 


,  ,  „  ,  Microsoft  Access 

Figure  10.  CEM  Data  Dictionary  Geometry  Table  and  Relations  Relational  Database 

Management  System. 

illhe  d*:  b°a™a nd  'hreSaCtag  the  data  from  .he  data  base  for  use  an  analys.s. 


SUMMARY 

Common  areas 
Structural  and 

Thermal 
investigations  are  also 
being  considered  to 
facilitate  transfer  of 
information  and  the 
future  development  of 

discipline-specific 

REF  data  dictionaries. 
These  areas  are  of 
immediate  interest  due 
to  their  interaction 
with  electromagnetics. 

The  REF  data 
dictionary  and  its 
support  software  were 
developed  to  provide  a 
means  for  validating 


of  CEM,  Computation  Fluid  Dynamics,  and  Finite  Eiemen.  Analysis  for 


Figure  11.  CEM  Data  Dictionary  Support  Code 


application  data  input  to  a  wide  variety  of  engineering  disciplines.  It  is 

andfacil,a.e  ex.end,ng  .he  dt^a  mod=^a«d  d,«^^^^^^ 


345 


REFERENCES 


Burke,  G.  J.,  Poggio,  A.  J.,  "Numerical  Electromagnetic  Code  (NEC)  -  Method  of  Moments,"  Part  III: 
User's  Guide,  Lawrence  Livermore  National  Laboratory,  prepared  under  contract  N-7405-Eng- 
48,  January  1981. 

Coffey.  E.L.,  D.L.  Kadlec,  "General  Electromagnetic  Model  for  the  Analysis  of  Complex  Systems 
(GEMACS),  Version  5.0",  Advanced  Electromagnetics,  RADC-TR-90-360,  Volumes  I,  II  and 
HI.  Dec  1990. 

Coffey,  E.L.,  D.L.  Kadlec,  "GEMACS  Source  Book,  Advanced  Electromagnetics",  RADC-TR-87-102, 
February  1988. 

Date,  C.J.,  "An  Introduction  to  Database  Systems  Volume  1,  Fourth  Edition",  Addison-Wesley 
Publishing  Company,  Reading  MA,  1986. 

Evans,  J.  A.,  “Research  and  Engineering  Framework  Data  Dictionary  Specification  for  Computational 
Electromagnetics”,  Applied  Computational  Electromagnetics  Society  11th  Annual  Review  of 
Progress  in  Applied  Electromagnetics,  p  126,  Mar  1995. 

Korth,  H.F.  and  A.  Silberschatz,  Database  Systems  Concepts,  McGraw  Hill  Book  Company,  New  York, 
1986. 

Lee.  S.  W..  D.  J.  Andersh,  "Users  Manual  for  XPATCH",  DEMACO,  Inc.  Sep,  1993. 

Marhefka,  R.  J.,  Burnside,  W.  D.,  "Numerical  Electromagnetic  Code  (NEC)  -  Basic  Scattering  Code 
(Version  2),"  Part  1;  User’s  Manual,  Technical  Report  712242-14,  The  Ohio  State  Universit>' 
ElectroScience  Laboratory,  Columbus;  prepared  under  Contract  N00123-79-C-1469  for  Naval 
Regional  Contracting  Office,  December  1982. 


346 


DEVELOPMENT  OF  AN  ELECTROMAGNETIC  AND  MECHANICAL  SIMULATION 
TOOL  FOR  THE  COMPUTER  MODELING  OF  THE  TACAMO  LFA^LF 
COMMUNICATION  SYSTEM* 

Mary  C.  Longtin,  Richard  W.  Sutton,  Kenneth  J.  Laskey  and  Philip  J.  Morrison 
Science  Applications  International  Corporation 
1710  Goodridge  Dr. 

McLean,  Virginia  22102 


ABSTRACT 

Over  the  past  several  years,  the  Naval  Air  Systems  Command  has 
sponsored  the  development,  at  SAIC,  of  a  computer  simulation  tool 
for  the  dynamic  electromagnetic  and  mechanical  modeling  of  the 
TACAMO  LFA^LF  communications  system.  This  paper  will  discuss 
the  development  of  that  simulation  tool,  especially  the  use  of  the  code 
NEC  (Numerical  Electromagnetics  Code)  in  that  tool,  along  with 
numerical  and  parameter  issues  that  arose  as  a  part  of  the  develop¬ 
ment. 

INTRODUCTION 

The  TACAMO  LF A^LF  communications  system  includes  two  wires  trailing  from  an  aircraft. 
The  longer  wire  of  the  two  can  vary  in  length  from  about  15,000  feet  to  26,000  feet,  and  the  shorter 
wire  can  vary  in  length  from  about  1,600  feet  to  4,000  feet.  The  wires  are  driven  by  a  power 
amplifier  on  the  aircraft  at  frequencies  from  about  17  to  60  KHz. 

Over  the  past  several  years,  the  Naval  Air  Systems  Command  (NAVAIR)  has  sponsored  the 
development  at  SAIC  of  the  TACAMO  Electromagnetic  Simulation  System  (TESS),  a  computer 
simulation  tool  for  the  dynamic  electromagnetic  and  mechanical  modeling  of  the  TACAMO  system. 
The  TESS  system  enables  a  user  to  predict  the  signal  strength  and  signal-to-noise  ratio  of  the 
TACAMO  transmission  at  locations  distant  from  the  TACAMO  transmitter. 

The  TACAMO  Electromagnetic  Simulation  System  contains  three  parts  or  separate  codes. 
The  first  part  is  a  dynamic  mechanical  model  of  the  long  and  short  trailing  wire  antennas  (LTWA 
and  STWA).  This  part  calculates  the  wire  shape  and  position  as  a  function  of  the  flight  path  of  the 
aircraft,  wind  conditions,  wire  length  and  other  variables.  The  second  part  consists  of  a  NEC 
calculation  of  the  current  distribution  on  the  LTWA  and  the  STWA  and  the  electric  field  radiated 
from  the  antennas  based  on  the  wire  shapes  that  are  output  from  the  mechanical  model.  The  third 
part  consists  of  a  calculation  of  the  signal  propagating  from  the  transmitter,  based  on  the  result  of 
the  NEC  calculation.  In  the  current  phase  of  this  project,  the  NEC  output  is  used  to  develop  an 


‘This  work  was  performed  for  the  Naval  Air  Systems  Command  under  contract  number 
N00421-91-D0206. 


347 


equivalent  dipole  to  the  TACAMO  system,  that  is,  an  elemental  dipole  whose  radiated  electric  fields 
would  best  fit  the  calculated  electric  fields  radiated  by  the  TACAMO  transmitter.  This  equivalent 
dipole  is  an  input  for  the  propagation  segment.  Another  input  for  the  propagation  segment  of  TESS 
is  the  total  power  emitted  by  the  TACAMO  antennas.  This  total  power  is  also  obtained  from  the 
NEC  output  file.  All  three  segments  of  TESS  are  controlled  by  a  user-fnendly  GUI  interface.  The 
sequence  of  calculations  in  TESS  is  shown  in  Figure  1. 

Currently,  we  are  using  either  SIMBAL  (Simulation  for  Multiple  Bursts  And  Links)  or 
WEDCOM  (Weapon  Effects  on  D-region  COMmunications)  as  the  part  of  TESS  that  describes  the 
propagation  of  the  TACAMO  signal  to  the  receiver  site.  Both  codes  describe  the  propagation  of  a 
signal  over  long  distances,  taking  into  account  the  earth's  curvature,  the  ionosphere,  signal  sources 
due  to  storms,  etc.  The  choice  between  these  two  codes  is  based  on  factors  such  as  accuracy  and  run¬ 
time  requirements.  For  example,  SIMBAL  runs  much  faster  for  a  given  problem  than  WEDCOM 
does,  since  SIMBAL  uses  look-up  tables  while  WEDCOM  performs  the  electromagnetic  calculation. 
Therefore,  a  user  that  requires  a  faster  calculation  will  opt  for  using  SIMBAL  for  the  propagation 
part  of  TESS.  The  version  of  NEC  that  we  are  currently  using  is  NEC4. 

The  GUI  interface  automatically  generates  the  NEC  input  file  from  the  output  of  the  dynamic 
mechanical  model,  and  the  SIMBAL  input  file  from  the  NEC  output  file.  For  example,  the  user 
chooses  a  simulation  time  and  some  flight  parameters  and  conditions,  the  mechanical  model 
calculates  the  TACAMO  wire  shape  at  that  simulation  time,  and  the  user  interface  generates  an 
approximation  to  that  wire  shape  using  straight  wires  and  converts  that  approximation  into  the 
correct  format  for  a  NEC  input  file.  Likewise,  the  GUI  interface  calculates  the  total  power  emitted 
from  the  TACAMO  antenna  and  the  equivalent  dipole  to  the  antenna,  as  discussed  earlier,  and 
converts  those  results  to  the  correct  format  for  a  SIMBAL  or  WEDCOM  input  file. 

DETAILS  OF  THE  NEC  MODEL 

Figure  2  shows  a  schematic  of  the  NEC  model  of  the  TACAMO  system.  The  TACAMO  long 
md  short  wires  are  modeled  by  connected  series  of  straight  wire  sections.  The  aircraft  is  simulated 
in  the  model  by  a  "wire"  with  a  diameter  such  that  the  capacitance  of  the  resulting  cylinder  to  ground 
is  approximately  equal  to  the  capacitance  of  the  aircraft  to  ground.  The  wire  series  representing  the 
long  and  short  TACAMO  wires  meet  at  a  point  on  the  aircraft.  A  voltage  is  driven  between  two 
segments  at  that  point. 

The  accuracy  of  the  end  result  of  a  TESS  calculation,  the  electric  field  some  distance  from 
the  transmitter,  depends  greatly  on  how  accurately  TESS  can  calculate  the  total  emitted  power  and 
the  equivalent  dipole.  The  total  power  emitted  from  the  TACAMO  antenna  is  calculated  by 
integrating  the  Poynting  vector  over  a  surface  containing  the  antenna.  The  emitted  power  and  the 
equivalent  dipole,  then,  depend  in  turn  on  how  accurately  NEC  can  calculate  the  current  on  the 
TACAMO  wires  and  the  electric  fields  radiated  from  the  wires.  Therefore,  the  NEC  model  of  the 
TACAMO  antenna  is  important  for  the  accuracy  of  the  TESS  calculation.  The  focus  of  this  paper 
will  be  on  the  work  that  has  been  done  to  refine  the  NEC  model  with  respect  to  the  parameters  that 
define  the  model. 

Note  that  since  the  NEC  portion  of  TESS  only  needs  to  calculate  the  electric  fields  near  the 
antenna  in  order  to  enable  TESS  to  calculate  the  power  emitted  from  the  antenna,  and  the 
propagation  code  calculation  deals  with  propagation  of  the  fields  over  long  distances,  the  results  of 
the  NEC  calculation  only  need  to  be  considered  in  the  near  field  of  the  antenna.  Therefore,  the 
effects  of  parameter  variations  only  need  to  be  considered  on  the  near  field,  not  the  far  field,  results. 
For  example,  one  can  include  a  model  of  ionospheric  reflections  in  the  NEC  model  and  study  the 
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variations  in  the  NEC  results  with  variations  in  the  ionosphere  height,  but  it  turns  out  that  including 
the  ionosphere  is  only  important  for  results  in  the  far  field  of  the  antenna,  and  so  is  not  considered 
as  a  factor  in  the  NEC  portion  of  TESS.  •  w 

The  following  parameters  define  the  TAC  AMO  NEC  model;  the  ntunber  of  wires  to  simulate 
the  long  and  short  TACAMO  antennas,  the  number  of  segments  per  wire,  the  frequency  of  the 
transmission,  the  wire  resistivity  and  the  ground  plane  dielectric  constant  and  ^onductivity^Some 
of  these  parameters  are  very  accurately  known  or  can  be  very  accurately  measured  by  the  TAC  AMO 
system  and  so  we  do  not  need  to  consider  the  effects  on  the  accuracy  of  the  TESS  calculation  of 
variations  in  these  parameters.  The  transmitter  frequency  and  the  wire  resistivity  are  two  such 
parameters  The  TACAMO  transmitter  controls  the  transmitting  frequency  to  a  high  degree  ot 
^curacy,  and  the  physical  characteristics  of  the  TACAMO  wires  are  very  well  known.  On  the  o±er 
hand,  the  ground  plane  characteristics  may  not  be  precisely  known  in  advance,  so  we  needed  to 
determine  if  a  variation  in  these  characteristics  had  a  significant  effect  on  the  results  of  the  NEC 
calculation.  The  best  number  of  wires  per  TACAMO  antenna  and  number  of  segments  per  wire  in 
the  NEC  model  also  needed  to  be  explored.  The  rule  of  thumb  is  that  the  segment  length  must  be  less 
than  a  tenth  of  a  wavelength  for  the  frequencies  of  interest*,  but  we  have  found  further  constraints 
on  the  segment  length  for  this  problem. 

PHYSICAL  WIRE  LENGTHS  VS  EM  WIRE  LENGTHS  AND  SEGMENTATION 

In  this  paper,  we  use  the  following  terminology  in  order  to  be  consistent  wth  the  NEC 
manual-  The  actual  TACAMO  trailing  wire  antennas  are  referred  to  as  "antennas",  the  largest 
subdivisions  of  the  antennas  are  referred  to  as  "wires"  or  "straight  wires",  and  the  smallest 
subdivisions  of  the  antennas  are  referred  to  as  "segments" . 

Both  the  physical  model  and  the  NEC  model  of  the  TACAMO  system  are  based  on 
approximating  the  TACAMO  antennas  by  connected  series  of  shorter,  straight  wires,  and  in  both 
calculations,  the  accuracy  of  the  calculation  and  the  run  times  increase  with  the  nuinber  of  divisions 
into  which  the  TACAMO  antennas  are  divided.  Due  to  the  way  the  GUI  interface  handles  the 
division  of  the  antennas,  there  must  be  at  least  as  many  straight  wires  in  the  NEC  model  as  there  are 

in  the  mechanical  model.  i.  •  i  j  xinr’ 

Figure  3  shows  the  relative  accuracy  when  the  number  of  wires  in  the  mechanical  and  NEC 

models  of  the  TACAMO  antenna  are  varied.  Each  data  point  refers  to  a  mechanical  calculadon  and 
a  NEC  calculation  in  series,  where  the  NEC  calculation  is  based  on  the  output  wire  shape  from  the 
mechanical  calculation.  The  point  labeled  "Baseline  model"  represents  the  mech^ical  and 
electromagnetic  segmentation  chosen  for  TESS,  based  on  the  run  time/accuracy  trade-offs.  For  fte 
mechanical  calculation,  this  baseline  model  entails  dividing  the  LTWA  into  50  straight  wires  For 
the  NEC  calculations,  this  baseline  model  entails  dividing  the  LTWA  into  10  straight  wires  and  the 
STWA  into  5  wires.  Dividing  the  antennas  more  finely  for  the  mechanical  or  NEC  calculation  causes 
an  increase  in  run-time  without  greatly  increasing  the  accuracy. 

This  leaves  the  question  of  how  to  divide  the  straight  wires  of  the  NEC  model  into  segments 
for  the  NEC  calculation.  As  has  been  mentioned,  the  rule  of  thumb  is  that  the  segment  lengths  inust 
be  less  than  a  tenth  of  a  wavelength  at  the  frequencies  of  interest  for  an  accurate  NEC  calculation 
A  typical  transmission  frequency  for  the  TACAMO  system  is  25  KHz,  which  gives  a  wavelength 


^"Numerical  Electromagnetics  Code  —  NEC-4  User's  Manual",  Gerald  J.  Burke,  LLNL 
Report  Number  UCRL-MA-109338,  page  3,  January,  1992. 
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of  12  km.  By  comparison,  the  long  TACAMO  wire  is  typically  about  5.9  km.  Dividing  the  long 
TACAMO  wire  into  ten  straight  wires  gives  a  wire  length  of  about  .05  wavelengths,  so  by  the 
previously  mentioned  "rule  of  thumb"  we  would  not  expect  to  need  more  than  one  segment  per 
straight  wire. 

However,  Figure  4  shows  results  from  a  representative  calculation  that  indicates  the 
sensitivity  of  the  NEC  model  to  the  number  of  segments  near  the  drive  point.  This  figure  shows  the 
polar  component  of  the  electric  field  from  the  TACAMO  system  as  a  function  of  the  polar  angle.  The 
numbers  of  straight  wires  and  segments  in  the  models  are  indicated  in  the  legend.  For  example  the 
curve  labeled  "16  Away,  18  Near"  refers  to  the  case  where  the  NEC  model  has  16  segments  per  ^re 
on  wires  away  from  the  drive  point  and  18  segments  per  wire  on  wires  near  the  drive  point  As 
expected,  reducing  the  number  of  segments  per  wire  from  six  to  one  does  not  change  the  results  of 
the  calculation  except  when  the  number  of  segments  on  the  wire  closest  to  the  drive  point  is  changed. 
When  the  segments  away  from  the  drive  point  remains  at  one,  but  the  segments  on  the  wire  nearest 
the  drive  point  are  reduced  from  18  to  10,  we  begin  to  see  a  change  in  the  accuracy  of  the  NEC 
calculation.  Based  on  a  series  of  similar  runs  and  on  run-time  considerations,  we  have  chosen  the 
NEC  model  to  use  in  TESS  to  contain  ten  straight  wires  for  the  long  wire  and  five  for  the  short  wire, 
with  ten  segments  per  wire  for  the  wires  nearest  the  drive  point  and  one  segment  per  wire  for  all  the 
other  wires. 

GROUND  PLANE  CHARACTERISTICS 

The  other  parameters  that  were  mentioned  above  as  possibly  having  some  effect  on  the  NEC 
model  results  are  the  dielectric  constant  and  conductivity  of  the  ground  plane  under  the  TACAMO 
system.  We  performed  a  trade  study  to  determine  the  importance  of  changing  these  parameters.  The 
results  of  the  trade  studies  show  that  the  current  distributions  and  the  wires  and  electric  fields  from 
the  wires  are  almost  independent  of  the  ground  plane  characteristics,  where  the  parameters 
characterizing  the  ground  plane  were  varied  from  those  typical  of  ocean  water  to  those  typical  of  dry 
earth.  The  impedance  of  the  TACAMO  wires  were  also  basically  independent  of  these  changes. 
Therefore,  the  power  emitted  from  the  TACAMO  antenna  is  essentially  independent  of  the  ground 
plane  characteristics,  for  a  physically  reasonable  range  of  those  characteristics,  and  the  ground  plane 
dielectric  constant  and  conductivity  are  found  to  be  unimportant  for  the  NEC  part  of  TESS.  The 
physical  reason  for  this  is  that  the  conductivity  of  the  ground  plane  gives  the  effective  position  at 
which  the  boundary  conditions  would  be  applied,  i.e.  a  skin  depth.  This  skin  depth,  however,  is 
negligible  compared  to  the  antenna  height  or  wavelength.  The  NEC  model  does,  however,  require 
a  ground  plane  beneath  the  TACAMO  system,  in  the  location  of  the  ground. 

This  does  not  mean  that  the  characteristics  of  the  ground  over  which  the  radiation  from  the 
TACAMO  antenna  is  propagating  are  unimportant  to  the  end  result  of  the  TESS  calculation.  The 
ground  plane  characteristics  have  a  large  effect  on  the  propagation  part  of  the  calculation.  However 
that  information  is  already  contained  accurately  in  SIMBAL  and  WEDCOM.  Those  characteristics 
do  not  affect  the  near  field  results  of  the  NEC  part  of  TESS. 

Figure  5  shows  an  example  of  results  from  a  full  TESS  calculation.  This  curve  shows  the 
vertical  component  of  the  electric  field  as  a  function  of  distance  from  the  TACAMO  antenna  near 
the  ground  plane,  as  calculated  by  the  full  TESS  code,  with  SIMBAL  as  the  propagation  section  of 
the  code.  Within  500  km  of  the  TACAMO  antenna,  the  vertical  electric  field  falls  off  as  1/r  as 
would  be  expected.  Farther  from  the  TACAMO  antenna,  the  vertical  electric  field  falls  off  more 
sharply,  approximately  as  due  to  the  increased  effects  of  ionosphere  and  ground  plane  losses 
at  larger  distances.  At  the  distance  of  the  "knee",  the  TACAMO  antenna  and  reception  location  are 
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out  of  line-of-sight  of  each  other,  resulting  in  the  increased  importance  of  ionospheric  losses.  This 
full  TESS  calculation  is  in  very  good  agreement  with  data. 

CONCLUSION 

The  current  version  of  the  TESS  system  is  a  useful  tool  for  calculating  the  expected  electric 
fields  long  distances  from  the  TAC  AMO  transmitter.  This  paper  has  shown  results  of  trade  studies 
which  have  determined  the  NEC  model  used  in  TESS.  Future  versions  of  TESS  will  include  detailed, 
automated  graphical  output  and  a  refined  version  of  the  NEC-SIMBAL  interface.  In  the  future 
versions  of  this  interface,  SIMBAL  will  be  modified  to  accept  as  input  from  NEC  the  actual  near- 
field  electric  fields  that  NEC  calculates  based  on  the  model  of  the  TACAMO  transmitter,  rather  than 
the  dipole  approximation  that  the  NEC-SIMBAL  interface  now  uses.  This  should  improve  the 
accuracy  of  the  final  results  of  the  TESS  calculation  even  further. 


Figure  1.  Sequence  of  Calculations  in  the  TACAMO  Electromagnetic  Simulation 
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Figure  2.  NEC  Model  of  the  TACAMO  Antennas 


Long  Wire  Antenna  -  Half 
Wavelength  Long  With  10 
Segments/Wire  Near  the 
Drivepoint  and  SegmentA/Vire 
away  from  the  Drivepoint. 
Radius  .01m  \ 


Airplane,  Long  Wire  and 
Short 

Wire  Electrically 
Connected  AtDrivepoint 


Short  Wire  ~  .045  Wavelengths  Long, 
With  10  Segments/Wire  Near  the 
Drivepoint  and  1  Segment/Wire  away 
from  the  Drivepoint 
Radius  .01m 


Airplane  Modeled  as  a  Fat  Wire 
Segment  with  10  Subsegments 
Radius  30m  (.0025  Wavelengths) 
Length  of  wire  is  about  0.0125 
Wavelengths 


Figure  3.  Dependence  of  Accuracy  and  Run-Time  on 
Segmentation  Methods 


Vertical  electric  field  (mV/m) 


Figure  4.  Sensitivity  of  the  NEC  Model  Results  to 
Number  of  Segments/Wire 


Theta 


Figure  5.  Sample  Results  from  a  TESS  Calculation 
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A  New  Look  at  Antenna  Traps 
Paul  W.  Leonard,  P.  E.  (W7LMX) 

Hatfield  &  Dawson  Consulting  Engineers 
4226  Sixth  Avenue  N.  W. 

Seattle,  WA  98107 

Abstract:  A  trap  monopole  is  described  which  gives  gain  at  the  high-band  frequency  in  trade  for 
a  poor  matching  impedance. 

Introduction:  Trap  antennas  have  been  used  for  a  long  time,  and  are  very  convenient  for 
multiband  antennas  since  they  give  a  good  impedance  on  each  frequency  band.  However,  the 
antenna  is  effectively  one-quarter  wavelength  long  (assuming  a  monopole)  at  each  frequency. 
At  the  higher  frequencies  the  structure  is  long  enough  to  support  a  gain  antenna.  There  is  plenty 
of  precedence  for  this.  One  example  is  known  as  the  "Franklin  Antenna,"  Reference  1.  These 
antennas  depend  upon  a  phase  reversal  at  the  current  node  in  order  the  keep  the  current  in 
phase  along  the  length  of  the  antenna.  Another  example  (Reference  2)  is  call  a  controlled 
current  antenna.  These  designs  are  single  band  antennas.  At  the  low  frequency,  say  40-meters 
(7.24  MHz),  a  short  monopole  with  inductive  loading  will  give  a  good  impedance  and  the  antenna 
gain  is  comparable  to  a  short  dipole.  I  propose  to  put  a  resonant  circuit  approximately  in  the 
center  of  a  25-foot  monopole  which  at  40-meters  (7.24  MHz)  provides  inductive  loading  and  at 
10-meters  (29.0  MHz)  provides  capacitive  reactance  for  gain. 

Test  calculations:  A  twenty-five  foot  monopole  was  modeled  using  our  MININEC  program, 

MBPRO,  Reference  3.  A  capacitive  reactance  was  modeled  at  the  middle  of  the  antenna. 
Several  values  of  capacitance  were  tried  to  determine  the  value  for  maximum  gain  at  29  MHz. 
Figure  1  shows  the  monopole  gain  over  a  short  monopole  versus  the  center  reactance.  This 
shows  that  a  value  between  -700  and  -900  ohms  gives  more  than  2.8  dB  gain.  Now  let’s  look 
at  the  base  impedance.  Figure  2.  This  is  a  poor  impedance  match  to  50-ohms.  However,  the 
new  transceivers  frequently  have  an  antenna  tuner  placed  at  the  base  of  the  monopole  that  is 
capable  of  matching  the  impedances  to  50-ohms,  Therefore,  by  use  of  a  capacitive  reactance 
in  the  center  of  the  antenna  we  can  have  nearly  3-dB  gain. 

At  40-meters  wavelength  the  25-foot  antenna  is  only  66  electrical  degrees  high.  The 
radiation  pattern  is  almost  the  same  as  that  of  a  very  short  monopole.  This  was  verified  with  the 
computer  model  and  is  true  even  if  the  current  along  the  monopole  is  uniform,  the  case  for 
maximum  gain.  Therefore,  all  the  loading  coil  can  do  at  the  40-meter  wavelength  is  improve  the 
base  impedance.  Figure  3  shows  the  base  impedance  versus  loading  coil  reactance.  An 
inductive  reactance  of  290  ohms  gives  a  base  impedance  that  can  be  matched  easily. 

The  trap  impedance  is  defined  as  follows: 

7.24  MHz  +290  ohms 
29.0  MHz  -700  ohms 
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Implementation:  A  shunt  resonant  circuit  consisting  of  a  5.42-microhenry  inductance  in  parallel 
with  a  13.39-  picofarad  capacitor  will  provide  the  above  impedance.  Note  that  this  circuit  has 
a  resonant  frequency  of  18.68  MHz.  There  are  many  ways  to  construct  this  circuit.  The  coil  can 
be  about  10  turns,  one  inch  in  diameter  and  one  inch  long.  The  capacitor  can  be  made  from 
telescoping  the  upper  mast  tube  about  one-half  inch  into  the  lower  tube  with  one-eighth  inch 
thick  insulation.  Another  way  to  construct  a  trap  is  given  in  Reference  4. 

Conclusion:  This  antenna  has  a  reasonable  impedance  at  7.24  MHz,  but  will  need  an  antenna 
tuner  at  the  base  for  29  MHz.  You  will  get  better  than  2.8  dB  gain  over  a  short  monopote  at  the 
29  MHz  frequency.  The  above  frequencies  were  for  demonstration  purposes.  Many  other  pairs 
of  frequencies  can  be  used. 
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GAIN  AT  29  MHz  VS  REACTANCE 
25-FOOT  MONOPOLE  -  REACTANCE  IN  CENTER 


BASE  IMPEDANCE  AT  29  MHz  VS  REACTANCE 
25-FOOT  MONOPOLE  -  REACTANCE  IN  CENTER 


BASE  IMPEDANCE  AT  7.24  MHz  VS  REACTANCE 
25-FOOT  MONOPOLE  -  REACTANCE  IN  CENTER 
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Abstract 

This  paper  describes  a  tomographic  system  which  images  conductive  and  ferromagnetic  materials 
using  a  magnetic  induction  technique.  The  region  of  interest  has  a  circular  shape  and  is  surrounded  by 
16  excitation/detection  coils,  equally  spaced  and  radially  oriented.  Data  acquisition  implies  that  each  coil 
in  turn  is  energised  with  a  100  kHz  sinusoidal  current  while  all  others  are  sensing  the  generated  magnetic 
flux.  A  total  of  240  independent  measurements  (16  projections  times  15  measurements  each)  are  then 
taken  in  about  10  ms.  The  sensor  design  has  been  validated  by  simulation  and  a  close  agreement  was 
found  between  simulated  and  real  data.  Two  different  algorithms  were  used  for  imaging  of  objects.  The 
first  is  a  heuristic  algorithm  based  on  a  priori  knowledge  and  enables  the  detection  of  both  the  location 
and  the  diameter  of  a  rod  of  different  materials  (highly  conductive,  highly  conductive  and  ferromagnetic, 
nomconductive  and  ferromagnetic)  with  a  small  error.  This  algorithm  is  very  efficient  and  works  at 
almost  real  time.  The  second  is  a  ART-type  algorithm  which  uses  the  experimentally  acquired  sensitivity 
maps  to  build  the  projection  matrix.  Presently,  qualitative  images  of  different  combinations  of  highly 
conductive  and  ferromagnetic  objects  placed  within  the  region  of  interest,  are  obtained.  In  a  66  MHz 
486  PC,  5  iterations  of  this  algorithm  take  about  3  seconds. 


1  Introduction 

Interest  on  the  use  of  electrical  tomographic  methods  for  process  applications,  specially  on-line  monitoring  and 
control,  has  been  growing  in  the  past  few  years.  c  o^i 

Electrical  Impedance  Tomography  (EIT),  originally  introduced  for  medical  imaging  [Barber  84,  Seager  87J, 
and  Electrical  Capacitance  Tomography  (ECT)  [Huang  92,  Xie  92]  are  techniques  relatively  well  developed  today. 
Systems  based  on  them  and  targeted  for  specific  applications  are  under  evaluation  [Gutierrez  95,  Remecke  95J. 

Electromagnetic  Tomography  (EMT)  or  Magnetic  Inductance  Tomography,  on  the  other  hand,  is  not  so  well 
understood.  A  few  scanners  based  on  its  principles  have  been  built  [Zeibak  93,  Yu  93,  Yu  94]  and  the  potential  of 
this  imaging  technique  for  processing  applications  is  currently  being  assessed.  General  overviews  may  be  found  m 

[Peyton  95,  Peyton  94].  ,  .  t  •  u  u 

In  this  paper  a  system  to  image  conductive  and  ferromagnetic  materials  is  described.  It  is  shown  how  scan¬ 
ner  design  has  been  validated  through  simulation,  and  results  produced  by  two  different  imaging  algorithms  are 
presented. 


2  Physical  Principle 

Figure  1  describes  in  a  schematic  way  the  physical  principle  of  EMT. 
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Figure  1:  Physical  principle  of  an  EMT  system. 


The  region  of  interest  is  energised  by  an  AC  sinusoidal  magnetic  field  created  by  one  or  more  excitations  coils 
placed  at  its  border.  Magnetic  field  distribution  is,  in  general,  affected  by  its  angular  frequency  (m)  and  by  the 
electrical  conductivity  (cr),  the  electrical  permittivity  (e)  and  the  magnetic  permeability  {fi)  of  the  medium. 

For  media  having  linear  and  isotropic  electrical  and  magnetic  properties,  and  further  assuming  that  a,  e  and  p 
are  constant  throughout  the  different  materials  in  the  region  of  interest,  and  that  the  displacement  currents  may 
be  neglected,  the  Maxwell  equations  in  phasor  notation  yield  [Kraus  92] : 

V  X  V  X  B  =  —jwapi  ■  B  (1) 

V-B  =  0  .  (2) 

The  tomographic  reconstruction  problem  can  then  be  stated  in  the  following  way:  knowing  for  different  ex¬ 
citation  patterns  the  values  of  the  radial  component  of  the  magnetic  flux  density  vector  B  along  the  border  of 
the  region  of  interest  (which  can  be  deduced  from  the  induced  voltage  in  the  detection  coils),  estimate  the  spatial 
distributions  of  the  electrical  conductivity  a  and  the  magnetic  permeability  /r  of  the  materials  that  lay  inside. 


3  Description  of  the  Imaging  System 

Figure  2  presents  a  general  view  of  the  EMT  system  that  was  built.  Its  block  diagram  is  illustrated  in  figure  3 
and,  as  it  can  be  seen,  consists  of  three  main  sub-systems:  the  scanner,  the  coils  interface,  signal  conditioning  and 
data  acquisition  circuitry  and  the  host  computer. 

The  scanner  is  made  of  three  concentric  plastic  pipes.  The  region  of  interest  is  defined  inside  the  innermost  (or 
third)  one  and  has  a  diameter  of  150  mm.  Sixteen  coils,  consisting  of  a  small  winding  on  a  core  mounted  ferrite 
rod,  are  positioned  between  the  second  and  the  third  pipe,  equally  spaced  and  radially  oriented.  Between  the  first 
and  the  second  pipe,  a  ring  shaped  shield,  made  of  compressed  ferrite  powder,  surrounds  the  region  of  interest  in 
order  to  confine  the  magnetic  flux  and,  thus,  increases  sensitivity  and  acts  as  a  screen  against  external  interference. 

The  sixteen  coils  are  wired  in  such  a  way  that  each  can  operate  as  an  emitting  or  receiving  coil.  The  coils 
interface  electronics,  through  computer  control,  defines  the  excitation  pattern.  The  main  function  of  the  signal 
conditioning  unit  is  to  convert  each  induced  voltage  into  two  basic  measurements  (amplitude  and  phase  shift) 
which  are  then  transferred  to  the  host  computer  by  the  data  acquisition  circuitry. 

The  adopted  excitation  pattern  implies  that  each  coil  in  turn  will  be  energised  with  a  100  kHz  sinusoidal 
electrical  current  while  all  the  others  will  be  sensing  the  generated  magnetic  flux  at  the  border  of  the  region  of 
interest.  This  constitutes  a  projection.  At  the  end  of  a  complete  scan,  16  projections  with  15  measurements  each 
are  obtained,  yielding  a  total  of  240  independent  measurements.  This  is  carried  out  in  about  10  ms. 

The  host  computer,  a  PC,  besides  acting  as  a  front-end  to  the  user,  controls  the  sensing  set  up,  runs  the 
tomographic  algorithms  and  displays  the  reconstructed  images. 


Figure  2:  EMT  system’s  general  perspective. 


SCANNER 


Figure  3;  Block  diagr^^m  of  the  EMT  system. 


4  Design  Validation 


A  computer  simulation  of  the  scanner  was  carried  out  to  check  its  performance  and  to  provide  data  to  help  the 

design  of  future  enhanced  EMT  scanners. 

The  analytic  solution  of  field  equations  for  the  scanner  geometry  with  and  without  the  presence  of  a  sample 
object  was  not  yet  developed  due  to  its  complexity,  although  some  results  for  a  related  geometry  (parallel  excitation 
field)  have  already  been  reported  [Albrechtsen  95,  Xiong  95).  o-  ,  . 

The  simulation  model  was  input  into  an  electromagnetic  simulator,  the  Ansoft  Maxwell  2D  Simulator,  version 
1.3.  The  simulation  output  was  the  flux  linkage  computed  by  the  Eddy  Current  Field  Solver  [Ansoft  94]  at  the 

location  of  each  detection  coil.  „  .  .  .  ^ 

The  region  of  interest  was  sampled  at  97  different  locations,  as  shown  in  figure  4.  Sensitivity  maps  were  obtained 
by  successively  placing  a  10  mm  sample  rod  at  those  locations.  For  the  simulation,  only  55  positions  are  needed 
due  to  the  symmetrical  properties  of  the  geometry  used.  Three  different  materials  were  considered  (aluminium, 
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ferrite  and  soft  iron) 


Figure  4:  Locations  at  which  the  region  of  interest  was  sampled. 

Figure  5  shows  the  sensitivity  maps  of  the  scanner  at  coil  number  7  for  both  simulated  and  real  data.  As  it  can 
be  seen,  the  two  maps  are  rather  close  even  though  one  results  from  a  2D  simulation  of  a  3D  structure.  The  reason 
for  that  stems  from  the  fact  that  the  diameter  of  the  coils  is  0.4  cm,  while  the  sample  rods  used  were  cylinders  3 
cm  height  placed  half  way  through  the  plane  that  contains  the  axis  of  the  coils.  The  noise  present  in  the  simulation 
data  sensitivitv  maps  results  from  the  re-meshing  which  takes  place  every  time  a  new  position  is  computed. 


5  Imaging  Algorithms 

5.1  Heuristic  Procedure 

Simple  image  reconstruction  algorithms  can  be  devised  for  EMT  systems  in  certain  situations  by  assuming  a 
priori  knowledge  of  the  object  being  imaged.  This  type  of  algorithms  solve  very  simplified  problems,  for  example, 
locating  the  position  of  a  single  target  object  of  known  geometry  and  material  in  the  region  of  interest  which  is 
otherwise  non-magnetic  and  non-conductive. 

The  heuristic  algorithm  being  applied  can  be  subdivided  into  four  steps: 

1.  Measurements  are  evaluated  to  decide  in  which  coarse  category  the  target  material  falls  in: 

•  non-conductive  and  ferromagnetic. 

•  highly  conductive  and  ferromagnetic. 

•  highly  conductive. 

This  information  is  used  in  the  following  steps  for  the  selection  of  proper  internal  parameters. 

2.  The  angular  coordinate  of  the  target  object  centre  in  a  polar  system  is  determined  by  selecting  the  projection 
with  the  highest  variation  for  the  detection  coil  just  opposite  to  the  excitation  one.  A  correction  is  then 
introduced  by  fitting  the  measurements  to  a  gaussian  curve. 

3.  The  radial  coordinate  of  the  target  object  centre  in  a  polar  system  is  determined  next  by  selecting  one  of  the 
orthogonal  projections  to  the  one  found  in  step  2  and  performing  linear  interpolation. 

4.  Having  found  the  target  centre  location  and  having  an  idea  about  the  target  material,  the  target  diameter  is 
finally  determined  also  by  linear  interpolation. 

Experiments  have  shown  that  it  is  possible  to  determine  the  (x.y)  position,  the  diameter  and  the  type  of 
material  of  a  sample  rod  with  good  results.  Figure  6  shows  the  plots  of  the  estimated  error  for  different  situations. 
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Figure  6:  Estimated  error  on  the  evaluation  of  the  diameter  of  a  sample  rod  (left)  and  estimated 
error  on  the  evaluation  of  the  distance  from  the  sample  rod  centre  to  the  middle  of  the  region  of 
interest  (right).  Different  materials  are  considered. 
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5.2  ART 


Inverting  equations  (1-2)  represents  a  strongly  ill-posed  problem.  Only  the  radial  component  of  the  magnetic 
flux  density  vector  B  is  known  and  even  that,  at  only  a  finite  set  of  points  at  the  boundary  of  the  region  of  interest. 
Thus,  even  if  noise  is  not  taken  into  account,  the  problem  will  be  underdetermined  for  the  general  case.  So  the 
best  approach  is  to  proceed  in  an  iterative  way. 

.4RT-type  (Algebraic  Reconstruction  Techniques)  form  a  class  of  iterative  algorithms  for  solving  large  systems 
of  equations,  based  on  a  quadratic  optimisation  criterion,  which  were  introduced  into  image  reconstruction  by 
Gordon,  Bender  and  Herman  in  1970  [Gordon  70],  A  good  overview  is  found  in  [Herman  80], 

In  matrix  notation,  the  system  of  equations  may  be  represented  in  the  following  way: 

p  =  A.f  ,  (3) 

p  =  (p^)  G  represents  the  measurement  vector,  f  =  (fi)  6  the  image  vector  and  A  =  (ojj)  6  3?^^  x  K'' 
the  projection  matrix,  which  accounts  for  the  physical  model  of  data  collection. 

The  algorithm  that  has  been  used  is  a  straightforward  implementation  of  Kaczmarz  projection  method  based 
on  a  least  squares  minimum  norm  criterion,  which  can  be  summarised  as  follows: 

Initialisation: 


is  arbitrary 


Each  step: 


llailP 

The  projection  matrix  was  computed  from  the  sensitivity  map  data  and  was  stored  in  memory  which  makes 
the  algorithm  rather  efficient.  In  a  66  MHz  486  PC,  five  iterations  take  about  3  seconds. 

However,  the  convergence  towards  a  reasonable  solution  is  not  assured.  The  projection  matrix  A  was  built 
assuming  linearity  which  is  not  true  since  the  field  interactions  are  quite  non-linear.  So  it  does  not  pay  to  proceed 
iterating  after  a  certain  point,  since  the  noise  in  the  image  will  rise.  The  rate  of  convergence  can  be  increased 
with  the  proper  ordering  of  the  equations,  the  choice  of  a  good  value  for  the  relaxation  factor  and  the  use  of  tricks 
[Herman  80]. 

Figure  7  shows  an  example  of  what  has  currently  been  achieved.  Spatial  resolution  is  not  very  good  and  the 
images  are  only  qualitative,  although  one  can  already  distinguish  between  highly  conductive  non-ferromagnetic 
objects  and  non-conductive  ferromagnetic  objects. 


ART  -  (10  Iters.  Lambda=0.9)  (amplitude) 


ART  -  (10  Iter,s.  Lambda=0.9)  (amplitude) 


Figure  7:  ART  reconstruction  of  three  2.5  cm  diameter  aluminium  rods  placed  at  the  corners  of  a 
2.5  cm  equilateral  triangle  located  at  the  centre  of  the  region  of  interest. 
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6  Conclusions 

This  paper  has  described  a  EMT  tomographic  system  able  to  image  conductive  and  ferromagnetic  materials. 
At  present,  this  system  is  only  laboratory  based  and  was  developed  primarily  to  assess  the  technique.  However, 
the  results  so  far  obtained  show  promise  at  its  application  to  process  monitoring  and  control. 

An  inherent  limitation  of  the  technique  has  to  do  with  the  signal  to  noise  ratio.  Magnetic  field  distortions  due 
to  the  presence  of  objects  in  the  region  of  interest  are  usually  very  small  when  compared  with  the  applied  inagnetic 
field.  This  means  that  signal  conditioning  electronics  has  to  be  designed  very  carefully  to  minimise  this  effect. 

On  the  other  hand,  reconstruction  algorithms  produce  presently  only  qualitative  images.  Further  investigation 
is  needed  to  develop  more  sophisticated  algorithms  which  will  separate  the  images  of  the  spatial  distributions  of 
the  electrical  conductivity  and  the  magnetic  permeability.  Thus,  turning  EMT  into  a  true  dual  modality  imaging 
technique. 
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Abstract 

In  this  paper,  the  use  of  wavelet-like  basis  functions  in  the  finite  element  analysis  of  elliptic 
problems  in  which  the  mesh  has  a  fine  discretization  in  one  region,  and  a  coarse  discretization 
over  the  remainder  of  the  region,  will  be  presented  Construction  of  the  wavelet-like  basis 
functions  for  this  type  of  discretization  will  be  discussed.  The  condition  numbers  of  the  resulting 
matrices,  along  with  the  number  of  steps  required  for  convergence  of  the  conjugate  gradient 
solution  will  be  presented.  Comparisons  will  be  made  with  results  obtained  from  a  finite  element 
algorithm  employing  traditional  basis  functions. 


Introduction 

A  wavelet  is  a  function  that  is  localized  in  both  the  spatial  and  frequency  domains,  ^d  is  such 
that  any  square  integrable  function  can  be  expressed  in  terms  of  translations  and  dilations  of  the 
wavelet.  The  properties  of  wavelets  give  them  numerous  applications  in  such  areas  as  sampling, 
time-frequency  analysis,  and  signal  compression.  Only  recently  have  wavelet  methods  been 
applied  in  areas  of  computational  electromagnetics.  The  use  of  wavelet  methods  in  integral 
equation  techniques,  in  which  the  unknown  response  is  represented  in  terms  of  shifted  and  dilated 
forms  of  the  wavelet,  has  been  examined  [1];  and  more  recently  wavelets  have  been  used  in  the 
finite  element  solution  of  partial  differential  equations  [2],  Gordon  has  discussed  the  use  of 
wavelet-like  basis  functions  in  the  finite  element  analysis  of  elliptic  problems  over  highly 
heterogeneous  domains  [3  ] . 

One  problem  of  the  finite  element  method  lies  in  the  solution  of  a  large  matrix  equation.  As 
the  number  of  basis  functions  is  increased,  the  dimension  of  the  system  matrix  increases;  and  as 
this  occurs,  the  condition  number  of  the  system  matrix  increases  as  well.  With  the  use  of  the 
usual  preconditioning  methods,  the  condition  number  of  the  system  matrix  will  increase  as 


number  will  go  to  infinity  as  - r ,  as  the  number  of  points  of  discretization  is  increased.  This 

;j|log(/j)| 

increase  in  condition  number  can  create  solution  problems  such  as  numerical  instabilities,  slow 
convergence,  or  even  lack  of  convergence.  With  the  use  of  wavelet-like  basis  functions  and 
simple  diagonal  preconditioning,  it  has  been  shown  that  the  finite  element  method  yields  a  system 
matrix  that  is  bounded  by  a  constant  as  the  number  of  basis  functions  is  increased  [4],  Thus,  the 
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use  of  wavelet-like  basis  functions  appears  to  have  significant  potential  in  the  solution  of  large 
matrix  equations  arising  fi-om  the  finite  element  method. 

Construction  of  the  Wavelet-like  Basis  Functions 

To  construct  the  wavelet-like  basis  functions  to  be  used  in  the  finite  element  algorithm,  the 
technique  described  in  [4  ]  is  employed.  This  is  a  multiresolution  analysis  and  proceeds  as  follows. 
An  appropriate  initial  discretization  of  the  problem  domain  is  chosen.  This  initial  discretization  is 
chosen  such  that  it  contains  only  a  few  nodes.  The  traditional  basis  functions  corresponding  to  the 
nodes  in  this  initial  discretization  are  orthonormalized  with  respect  to  each  other.  These 
orthonormalized  functions  are  the  first  of  the  wavelet-like  basis  functions.  New  nodes  are  then 
added  to  the  mesh  at  locations  that  divide  the  subsections  between  the  previous  nodes.  The 
traditional  basis  functions  corresponding  to  these  new  nodes  are  then  orthogonalized  against  the 
existing  wavelet-like  basis  functions.  Then,  the  resulting  functions  are  orthonormalized  with 
respect  to  each  other.  The  functions  that  follow  from  this  procedure  are  then  added  to  the  basis 
of  wavelet-like  functions.  To  further  refine  the  mesh  in  a  given  region,  new  nodes  are  added 
between  the  existing  nodes  only  in  the  region  of  interest.  The  traditional  basis  functions 
associated  with  these  new  nodes  are  orthogonalized  against  the  existing  wavelet-like  functions, 
and  the  resulting  functions  are  then  orthonormalized  with  respect  to  each  other.  The  number  of 
points  of  discretization  may  be  increased  by  repeating  this  procedure.  Once  the  desired 
discretization  is  obtained,  the  standard  finite  element  method  is  then  employed  using  the  wavelet¬ 
like  functions  which  have  been  constructed. 

Consider  the  elliptic  problem: 


-  V  ■  iaVu)  +  bu  =  g  onO. 
u  =  hondD. 

where  Q  is  the  problem  domain,  dD.  is  the  boundary  of  D,  <3  >  C  >  0,  and  ^»  >  0. 

As  an  illustration  of  these  ideas  consider  the  following  one  dimensional  problems; 

Case  A;  Q  is  the  line  segment  fi-om  x-0  to  x-1  and: 

_  jLOfor  0.25  <x  <0.5 
^  ^  [25.0  otherwise 

b(x)  =  0.0 
w(0)=10.0 
u(l)  =  25.0 
g-(x)  =  0.0 
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Case  B:  Cl  is  the  line  segment  from  x=0  to  x=l  and: 

a(x)  =  1.0 


bix)  =  3.0 
k(0)  =  5.0 
w(l)  =  15.0 
g(x)  =  xexp(x) 


With  the  above  conditions,  it  is  desired  to  have  more  basis  fiinctions  in  the  region  .25<x<.5, 
and  fewer  basis  functions  outside  this  region.  The  first  few  basis  functions  are  shown  in  figures  1- 
5.  The  initial  discretization  contains  a  single  node  located  at  the  midpoint  of  the  interval.  Figure 
1  shows  the  first  function  added  to  the  basis  corresponding  to  this  node.  Figures  2  and  3  show 
the  basis  functions  which  result  from  new  nodes  being  added  to  the  mesh.  At  this  pomt,  new 
nodes  are  only  added  in  the  region  .25<x<.5,  and  the  resulting  basis  functions  are  shown  m 
figures  4  and  5. 

Numerical  Results 

Case  A:  The  problem  above  was  solved  using  7,  9,  13,  21,  37,  69,  133,  261,  517,  and  1029 
wavelet-like  and  traditional  basis  functions.  Figure  6  shows  the  condition  number  versus  number 
of  basis  functions  when  wavelet-like  basis  functions  are  used.  The  corresponding  plot  for  the 
traditional  basis  functions  can  not  be  shown  on  the  same  graph  because  it  would  be  far  off  scale, 
thus  Figure  7  is  presented  to  show  these  results.  From  these  figures,  it  can  be  seen  that  the 
condition  number  is  576,513  when  1029  traditional  basis  functions  are  used,  and  that  when 
wavelet-like  basis  functions  are  used  the  condition  number  reaches  a  value  of  37.64  and  then 
decreases  as  the  number  of  basis  functions  is  increased.  Clearly,  the  preconditioned  ^obal  matrix 
arising  from  the  use  of  wavelet-like  functions  is  much  better  conditioned  than  that  arising  from  the 
use  of  traditional  basis  functions.  The  importance  of  this  is  demonstrated  in  figures  8  and  9.  In 
these  figures  we  see  plots  of  the  number  of  steps  required  for  the  convergence  of  the  conjugate 
gradient  algorithm  for  each  of  the  cases  presented  in  figures  6  and  7.  (The  conjugate  gradient 
algorithm  was  applied  using  the  preconditioned  global  matrix  for  which  the  condition  number  is 
plotted  in  figures  6  and  7.)  From  these  figures,  it  is  seen  that  when  wavelet-Uke  basis  functions 
are  used,  the  number  of  steps  remains  less  than  21  even  when  1029  basis  functions  are  used.  For 
the  same  number  of  traditional  basis  functions,  the  number  of  steps  required  is  1029.  From 
Figure  8  it  can  bee  seen  that  when  wavelet-like  basis  functions  are  used,  the  number  of  steps 
required  for  a  matrix  of  order  1029  is  only  5.26%  greater  than  the  number  of  steps  required  for  a 
matrix  of  order  69,  and  only  25%  greater  than  that  required  for  a  matrix  of  order  21. 

Case  B:  This  problem  was  also  solved  using  7,  9,  13,  21,  37,  69,  133,  261,  517,  and  1029 
wavelet-like  and  traditional  basis  functions.  Figures  10  and  1 1  show  the  condition  number  versus 
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number  of  basis  functions,  when  wavelet-like  and  traditional  basis  functions  are  used.  In  this  case, 
the  eflfect  on  the  condition  number  is  more  dramatic.  When  traditional  basis  functions  are  used, 
the  condition  number  climbs  to  more  than  2.8  million  for  1029  basis  functions.  For  the  same 
number  of  wavelet-like  basis  functions,  the  condition  number  remains  less  than  17.  Figures  12 
and  13  show  the  number  of  steps  required  for  convergence  of  the  conjugate  gradient  algorithm  for 
the  preconditioned  matrices  arising  from  the  use  of  wavelet-like  and  traditional  basis  functions. 
Again,  these  figures  show  that  the  performance  obtained  using  the  wavelet-like  basis  functions  is 
much  better  than  that  of  the  traditional  basis  fimctions.  From  Figure  12,  we  can  see  that  the 
number  of  steps  required  for  a  matrix  of  order  1029  is  7.69%  greater  than  that  for  a  matrix  of 
order  69,  and  only  27.2%  greater  than  the  number  of  steps  required  for  a  matrix  of  order  21. 

Conclusions 

In  this  paper,  the  use  of  wavelet-Uke  basis  functions  in  the  solution  of  elliptic  problems  over 
one- dimensional  domains  is  discussed.  In  this  case,  the  use  of  such  basis  functions  yields  matrices 
that,  with  simple  diagonal  preconditioning,  are  much  better  conditioned  than  are  those  resulting 
from  the  use  of  traditional  finite  element  basis  functions.  This  leads  to  much  more  rapid 
convergence  of  the  conjugate  gradient  algorithm  when  these  wavelet-like  basis  fimctions  are 
employed. 
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ABSTRACT 

Scientific  and  engineering  research  and  design  codes  are  typically  FORTRAN-based, 
different  data  structures  and  I/O,  and  unique  and  inconsistent  user  interfaces  resulting  in  long  leading 
ojrves  and  difficulties  in  usage.  This  is  true  of  the  codes  implenrenting  the  f 

electromagnetics  (CEM)  and  thermomechanics  (TM)  formulations.  Manual  manipulation  of  data  from 
code  to  code  is  required  to  ensure  validity  of  the  input.  Coupled  with  the  difficulty  of  graphically  vowing 
the  geometry  and  translating  it  to  an  analysis  model,  the  r^ult  is  numerous 
introduction  of  human  errors  into  the  design  process.  This  situation  also  adversely  ^ 

(human  and  software/hardware)  of  maintaining  and  extending  design  capabilities  as  the  theoretical 
foundation  and  design  needs  advance.  The  Research  and  Engineenng  Framework  (REF)  P/ovides  a 
coherent,  heterogeneous  design  environment  which  is  toe  first  step  toward 
problems.  Although  the  REF  was  initially  developed  to  address  the  problerns  of  the  U.S.  power  tube 
industry,  the  underlying  framework  Is  open;  It  can  accommodate  a  broad  range  of  saentifi^c  and 
engineering  disciplini.  including  CEM  modeling  and  simulation.  The  REF  focuses  on  providing  foab  ing 
technology  in  several  critical  areas  including:  ease  of  use  and  consistency,  support  for  a  fpjster 
qeometry;  data  sharing;  integration  of  codes  at  different  dimensionalities,  heterogeneous  networtang, 
numerical  grid  generation;  visual  connection  of  tools  and  data  flows;  and  configuration  management  of 
design  data  for  project  teams. 

During  the  past  year,  the  REF  Development  Team  has  made  several  enhancements  to  the  framework 
including  a  new  control  panel,  a  visualization  suite,  a  programmatic  interface  for  sharing  solution  data, 
standards  for  the  exchange  of  mesh  data,  enhanced  code  wrappering  teginiques,  a 
management  system,  and  data  dictionary  management  software  flowing  REF  users  to  more  readily 
insert  discipline  specific  dictionaries  (such  as  that  required  for  CEM)  into  the  environment. 


This  paper  expands  on  those  given  at  earlier  ACES  conferences’^  and  on  art  article  in  the  ACES 
Newsletter’®’  to  provide  an  updated  view  of  the  capabilities  of  the  Research  and  Engineenng  Framework. 
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BACKGROUND 


The  MMACE  program  has  been  underway  for  over  four  years  and  is  now  in  its  second  phase.  The  first 
phase,  which  began  in  1992,  was  an  investigation  phase.  Four  teams  were  put  in  place  to  study  the 
problems  of  the  Microwave  Power  Tube  industry  and  develop  roadm^s  for  the  implementation  of  a 
framework  to  address  those  problems.  The  teams  found  an  existing  design  environment  that:  was  based 
largely  on  in-house  developed  software:  was  largely  unsupported  by  commercial  vendors;  did  not  take 
advantage  of  the  latest  advances  in  CAD,  visualization,  database  technology,  and  networking;  was 
addressing  technically  difficult  problems;  contained  a  collection  of  tools  that  were  not  well  integrated  with 
each  other;  and  was  crucial  to  projects  deemed  important  to  the  US  government.  While  these  items  were 
true  of  the  power  tube  industry,  the  same  can  be  said  of  many  niche  scientific  disciplines  including  CEM. 
Once  the  investigation  was  complete,  the  four  MMACE  teams  began  constructing  a  prototype  system  to 
experiment  with  some  of  the  proposed  solutions.  This  prototype  was  released  in  early  1994. 

Phase  II  of  the  MMACE  program,  scheduled  to  last  three  years,  began  in  March  of  1994.  Raytheon 
Company  and  Compact  Software,  along  with  several  subcontracto^s^  are  under  contract  to  jointly 
develop  the  complete  framework.  However,  it  became  obvious  that  the  problem  should  be  split  into  two 
parts.  The  first  part  of  the  problem  addresses  those  areas  specific  to  power  tube  design.  The  second 
part  of  the  problem  addresses  areas  that  are  common  to  multiple  design  disciplines  and  therefore  have 
far  greater  applicability.  The  portion  of  the  system  common  to  multiple  disciplines,  including  CEM,  is 
referred  to  as  REF,  the  Research  and  Engineering  Framework.  Version  2.0  of  the  REF  was  shipped  in 
January  of  1996  and  is  now  available  for  both  the  Hewlett-Packard  and  Sun  platforms.  Source  code  is 
available  for  those  wishing  to  port  the  system  to  additional  platforms. 

OVERVIEW 

The  REF  contains  programming  interfaces,  standards,  and  tools  to  aid  in  the  integration  of  codes.  In 
addition,  it  provides  the  data  management,  networking,  visualization,  and  "Master"  geometry  needed  to 
drive  analysis  codes.  The  REF  also  provides  the  utilities  needed  to  construct  graphical  user  interfaces 
irrcluding  the  top  level  control  panel.  The  REF  portion  of  the  system  can  be  compared  to  a  computer 
operating  system:  generic,  powerful,  and  providing  much  functionality  but  of  little  interest  to  the  end  user 
who  requires  software  that  is  written  to  operate  on  top  of  it.  Rgure  1  shows  the  structure  and  major 
components  of  the  REF. 


Figure  1  -  REF  Architecture 


^  The  Compact/Raytheon  Team  includes  Sabbagh  Associates.  Hughes,  Litton,  Triton  Services, 
Teledyne,  CPI  and  Northrop  Grumman  as  subcontractors. 
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On  top  of  the  REF,  one  can  layer  design  tools  specific  to  a  given  design  discipline,  such 

MMACE  program,  we  have  tested  the  REF  by  integrating  a  collection  of  design  tools  used  in  the  des  gn 

and  anal^is  of  power  tubes  and  the  data  dictionary  describing  the  data 

and  analysis.  These  design  and  analysis  codes  can  be  compared  to  off  the  shelf 

loaded  on  a  system  and  accessed  by  the  end  users  to  perform  specific  functions^  However,  like  off  the 

shelf  software,  without  the  underlying  operating  system  (in  this  rase  the  REH 

useless.  Figure  2  shows  the  REF  with  discipline  specific  tools,  databases,  and  data  d'cftonaries 

connected  to  provide  ttie  user  with  a  complete  environment.  Although  the  sfwafic  codes  ^® 

Integrated  for  CEM  would  be  different  from  those  that  have  been  integrated  to  date,  the  concept  is  the 

same. 
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Figure  2  -  REF  Environment  Customiied  for  a  Specific  Design  Discipline 
RESEARCH  AND  ENGINEERING  FRAMEWORK  (REF)  DETAIL 

While  Phase  I  of  the  MMACE  program  concentrated  on  developing  design  requirements  ^d 
implementation  of  a  prototype  framework,  Phase  II  focuses  on  enhancing,  extending,  and  developing 
framework  technology  in  eight  key  areas: 


-  Ease  of  use  and  consistency 

-  Sharing  of  common  design  data 

-  “Master"  geometry  driven  analysis 

-  Standard  foundation  for  numerical  grid  generation 

-  Integration  of  tools  at  different  dimensionalities 

-  Visual  connection  of  tools  and  data  flows 

-  Use,  configuration  &  management  of  design  data 

-  Transparent  support  for  network  communications  &  distributed  processing 

During  the  year  that  has  past  since  the  last  ACES  conference,  the  MMACE  program  has  made  great 
progress  in  several  of  these  key  areas. 
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ENHANCED  CONTROL  PANEL 


During  the  past  year,  the  control  panel  for  the  system,  shown  in  figure  3,  has  been  completely 
redesigned.  The  MMACE  Control  Panel  is  the  first  thing  the  user  sees  when  accessing  the  Framework. 
While  the  REF  provided  the  underlying  tools  for  creating  the  control  panel,  the  specific  button  layout  and 
dependencies  that  are  shown  are  specific  to  power  tube  design.  Obviously,  ttie  buttons  and  layout  would 
be  different  for  CEM.  The  control  panel  contains  buttons  for  each  of  the  codes  available  to  the  user. 
These  buttons  are  laid  out  in  the  process  flow  appropriate  to  a  given  project.  Each  design  could  have  a 
different  process.  These  tool  flows  can  be  saved  for  use  on  ofrier  projects  and  can  also  be  shared 
between  users  on  the  same  project.  The  user  simply  executes  codes  by  pushing  buttons  available  on  the 
OSF/Motif  compliant  interface.  The  control  panel  has  been  designed  in  such  a  way  to  make  it  possible  to 
“plug”  in  new  codes  as  needed.  It  automatically  checks  to  make  sure  that  a  code  has  the  data  necessary 
to  execute  and  triggers  the  code’s  wrapper,  if  necessary,  to  create  all  needed  input  files.  The  new  control 
panel  also  includes  logic  to  automatically  run  parametric  studies.  The  control  panel  will  execute  the  code 
over  a  range  of  database  values  saving  the  results  for  later  analysis. 


Figure  3  -  Enhanced  Control  Panel 

TTie  control  panel  can  be  configured  to  run  through  a  series  of  analysis  codes  without  user  intervention. 
This  is  especially  useful  for  codes  that  require  several  hours  to  complete.  The  user  can  simply  start  the 
first  code  and  go  home.  The  control  panel  will  continue  executing  down  stream  codes  until  the  sequence 
is  completed  or  an  error  occurs. 
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CEM  design  and  analysis  tools  can  easily  be  integrated  into  the  system  using  the  wrappering  tools 
included  with  the  REF.  Once  wrappered,  the  user  can  interactively  add  a  button  on  the  control  panel  to 
access  that  code. 


Another  major  enhancement  made  to  the  system  over  the  past  year  is  the  inclusion  of  softw^e  to  allow 
users  to  easily  view  and  modify  the  data  dictionary.  The  data  dictionary  can  be  thought  of  as  the  index  to 
the  data  entities  known  by  the  framework.  Each  design  discipline,  and  possibly  each  company,  might 
have  its  own  data  dictionary.  This  software,  shown  in  figure  4,  provides  an  easy  way  to  search  and  edit 
the  dictionary.  Code  developers  can  use  the  system  to  find  the  database  name  of  the  parameter  n^ded 
for  their  code’s  input.  Likewise,  the  code’s  end  user  can  use  the  system  to  find  tiie  descriptiori,  default 
value,  and  constraints  of  a  given  field  needed  by  the  code.  The  Data  Dictionary  software  also  allows  the 
code  developer  to  extend  the  dictionary  as  needed  to  allow  for  additioiial  data  entities  to  be  included  in  an 
analysis.  The  software  automatically  keeps  track  of  changes  to  the  diction^  so  that  the  user  can  easily 
distinguish  “approved”  dictionary  information  from  later  modifications  to  frie  dictionary. 


Figure  4  -  Date  Dictionary  Support  Software 


NUMERICAL  GRID  GENERATION 


A  key  feature  of  the  REF  environment  is  the  support  for  numericaf  grid  generation.  In  the  current  design 
environment,  gridding  software  is  often  tied  to  a  particular  code  and  Is  sometimes  embedded  within  that 
code.  The  objective  of  the  MMACE  program  is  to  develop  standards  for  the  exchange  of  grid  information 
and,  in  doing  so,  allow  multiple  codes  to  share  grid  information.  In  addition,  by  creating  this  standard 
layer,  users  are  able  to  plug-in  the  latest  gridding  software  as  it  becomes  available.  To  test  this,  an 
existing  grid  generator,  Ingrid,  has  been  adapted  to  use  the  REF  gridding  standard.  The  grids  can  then 
be  saved  and  later  accessed  by  both  electromagnetic  and  thermomechanical  codes.  An  example  of  the 
type  of  grids  supported  is  shown  in  figure  5. 

The  REF  grid  generation  requirements  for  both  thermomechanical  (TM)  and  computational 
electromagnetic  (CEM)  analysis  can  be  briefly  summarized  as  follows: 

•  Mixture  of  line,  surface  and  solid  elements  -  node-based  geometric  entities  in  TM,  and  both 
node  and  edge-based  in  CEM. 

•  Steep  gradient  of  grid  density  to  accurately  model  localized  phenomena. 

•  P-type  and  H-type  grid  refinements  in  adaptive  gridding 

•  Specification  of  boundary  codes,  loadings,  initial  conditions  and  material  properties  (e.g., 
density,  thermal  and  electrical  conductivities,  Young's  modulus,  Poisson's  ratio,  permeability, 
dielectric  permitivity,  etc.) 

•  Sliding  interfaces  and  joints  in  TM,  and  surfaces  of  discontinuities  in  CEM. 

•  Symmetry  (or  anti-symmetry)  planes 

•  Computed  quantities  for  post  processing  -  for  TM:  temperature,  displacements,  velocities, 
accelerations,  stresses,  bending  moments,  material-dependent  variables,  etc.;  for  CEM: 
electric  and  magnetic  fields,  current  density,  scalar  and  vector  potentials,  etc.  These  may 
involve  complex  numbers. 

•  Rectangular  grids  used  in  the  finite-difference  and  volume-integral  methods  in  CEM. 


Figure  5  Gridding 

These  requirements  were  used  as  input  for  the  design  and  development  of  the  REF  gridding  interface,  a 
FORTRAN  library  for  the  storage  and  retrieval  of  grid  information. 
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nnnfiauratton  and  Management  ot  uesiqn  uaia 

Proper  management  of  design  data  is  critical  to  producing  a  quality  product.  Typically,  desi^gn  data  has 
been  maintained  by  the  design  engineer  in  hardcopy  format.  Data  is  often  stored  in  ^veral  filing  cabinets 
and  in  other  places  throughout  the  organization,  it  is  often  difficult  to  find  the  latest  version  of  frie  data, 
especially  if  people  involved  in  the  original  design  are  no  longer  around. 

A  configuration  and  management  system  was  developed  during  the  past  y^r  so  that  ^ 
automahcally  track  design  data.  The  system,  shown  in  figure  6,  keeps  f  ^onic  copi^  o  d^erent 
versions  of  the  data,  both  for  designs  under  development  and  for  products  that  have  shipped.  Users  are 
able  to  instantly  retrieve  data  from  any  version  of  the  product.  In  addition,  by  automatically  k^ping  track 
of  the  data,  users  are  able  to  explore  different  design  variations  without  having  to  worry  about  overwnting 
infonnation  from  alternate  design  options. 

Once  a  design  is  complete,  data  for  alternate  design  options  can  be  deleted  to  ojnserve  space  and  the 
final  database  can  be  archived  in  an  electronic  vault  for  future  reference.  Often,  this  archived  version  will 
be  used  as  the  starting  point  for  upgrades  or  even  for  new  designs  that  are  similar  to  the  one  archived.  A 
browser,  included  with  the  CM  system,  makes  it  easy  to  locate  archived  designs. 


Figure  6  -  Configuration  Management 

The  REF  design  data  management  system  is  intended  to  be  an  adjunct  to,  not  a  repla<^ment  for,  more 
capable  and  full-featured  corporate  data  management  systems.  The  focus  of  REF  data  administration 
capabilities  is  to  provide  for  the  support  of  small  R&D  groups  in  the  configuration  and  management  of 
their  technical  data. 


ADDITIONAL  FNHANCED  FUNCTIONALITY 


In  addition  to  the  portions  of  the  framework  described  above,  the  MMACE  program  has  also  ma^ 
significant  advances  during  the  past  year  in  the  areas  of  data  visualization,  code  wr^penng,  and  the 
exchange  of  solution  data. 
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In  the  area  of  visualization,  a  suite  of  visualization  tools  has  been  integrated  which  provide  a  wide  range 
of  visualization  capabilities  including  animation^.  By  utilizing  these  tools,  code  developers  can  focus  their 
time  on  developing  the  physics  portions  of  the  code  rather  than  on  data  presentation. 

In  the  area  of  code  wrappering,  the  existing  utilities  were  extended  to  allow  a  much  wider  range  of  codes 
to  be  easily  inserted  into  the  framework.  The  wrappering  software  has  also  been  extended  to  allow  users 
to  perform  unit  conversions  and  data  formulations  within  the  code  wrapper. 

For  solution  data,  a  standard  interface  was  developed  to  allow  codes  to  store  and  exchange  finite  element 
and  finite  difference  solution  data.  This  is  extremely  useful,  especially  when  used  in  conjunction  with  the 
gridding  interface. 

SUMMARY 

Phase  tl  of  the  MMACE  program  is  providing  a  full  research  and  engineering  framework  for  specialized 
technical  communities  which  will  approach  the  functionality,  quality,  and  robustness  found  In  commercial 
products.  The  REF  core  development  is  focusing  on  some  of  the  problems  not  addressed  adequately  by 
commercial  vendors  because  of  the  tvpically  small  market  size  of  engineering  and  scientific  communities. 
Specifically,  MMACE  and  its  underlying  REF  core  will  provide  a  low-cost  environment  that  is  easy  to  use, 
has  a  consistent  interface,  promotes  data  sharing,  provides  a  unified  source  of  geometry,  supplies 
standards  for  numerical  grid  generation,  integrates  1D,  2D,  and  3D  codes,  provides  a  visual  means  for 
the  user  to  view  data  and  data  flows,  performs  configuration  management  of  the  design  data,  and 
supports  the  use  of  heterogeneous  networks  of  computers.  The  program  has  made  significant  progress 
toward  meeting  these  objectives. 

The  REF  is  providing  the  foundation  for  design  environments  that  are  applicable  to  a  wide  variety  of 
technical  design  disciplines  including,  power  tube  design,  antenna  design,  and  computational 
electromagnetics  among  others.  The  REF  software  is  distributed  at  no  charge®  and  the  developers  are 
eager  to  work  with  anyone  who  desires  to  make  use  of  the  software. 
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Abstract 

A  body  of  revolution  finite  difference  time  domain  formulation  has  been  developed  for  obtaining  the 
broadband  (30MHz  to  6GHz)  reflection  coefficient  of  dispersive  liquids  in  a  short-circuited  coaxial  cavity 
fed  by  a  coaxial  transmission  line.  The  dispersive  behavior  is  described  by  a  convolution  integral.  The 
code  solves  the  discretised  convolution  integral  by  recursion.  The  predictions  of  the  code  for  the  reflection 
coefficient  of  a  transition  from  Teflon  to  distilled  water  are  compared,  in  the  frequency  domain,  both  to 
data  measured  by  an  HP  8753C  network  analyzer  and  to  the  numerical  predictions  of  a  transmission  line 
equivalent  circuit  model.  A  second  feature  of  the  code  handles  a  step  discontinuity  in  a  coaxial  system 
with  a  non-dispersive  dielectric  material.  It  is  shown  how  this  feature  was  used  to  optimize  the  stepped 
transition  in  the  coaxial  measuring  system.  The  results  of  the  FDTD  code  show  good  agreement  with  the 
experimentally  determined  optimum  design. 


1  Introduction 

Knowledge  of  the  dielectric  properties  of  materials  and  their  fretiuency  de]3endent  behaviour  at  radio  and  mi¬ 
crowave  frequencies  is  important  both  in  fundamental  and  applied  re.search.  The  measurement  of  reflections 
from  samples  placed  in  coaxial  lines  is  a  well-known  technique  for  determining  the  dielectric  properties  of 
materials. 

This  paper  describes  a  body  of  revolution  finite  difference  time  domain  (BOR-FDTD)  formulation,  based 
on  the  work  of  Davidson  and  Ziolkowski  [1],  which  has  been  developed  for  obtaining  the  broadband  (30MHz 
to  6GHz)  reflection  coefficient  of  disiiersive  liquids  in  a  short-circuited  coaxial  cavity  fed  by  a  coaxial  trans¬ 
mission  line. 

The  resultant  code  solves  for  the  time  evolution  of  all  six  field  components  inside  a  coaxial  system.  The 
code  has  two  features.  The  first  simulates  the  time  evolution  of  the  field  components  in  a  coaxial  line  with  a 
non-dispersive  dielectric  material  and  allows  for  step  discontinuities  in  both  the  inner  and  outer  coaxial  con¬ 
ductors.  It  is  shown  how  this  feature  was  used  to  optimize  the  design  of  a  sample  holder  for  liquid  dielectrics. 

The  second  feature  simulates  the  time  evolution  of  the  field  components  in  a  coaxial  line  which  contains 
a  transition  from  a  non-dispersive  to  a  dispersive  material.  The  predictions  of  the  code  for  the  reflection 
coefficient  of  a  transition  from  Teflon  to  distilled  water  are  compared,  in  tlie  frequency  domain,  both  to  data 
measured  by  an  HP  8753C  network  analyzer  and  to  the  numerical  predictions  of  a  transmission  line  equivalent 
circuit  model. 
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Figure  1:  Cross-section  of  the  designed  measuring  system. 


2  Body  of  revolution  finite  difference  time  domain  formulation 

The  measurement  system  for  measuring  the  reflections  from  a  dispersive  liquid  sample  in  a  short-circuited 
coaxial  cavity  with  a  network  analyser  is  shown  in  Figure  1.  The  cavity  is  fed  by  a  coaxial  connector  and 
terminated  in  a  removable  short  circuit.  The  dimensions  of  the  cavity  result  in  a  50fi  system  in  Teflon.  The 
transition  from  the  connector  to  the  cavity  occurs  in  a  region  filled  with  Teflon. 

A  body  of  revolution  finite  difference  time  domain  (BOR-FDTD)  formulation  was  developed  to  model 
the  measuring  system.  The  BOR-FDTD  formulation  solves  the  time-dependent  Maxwell  curl  equations  nu¬ 
merically.  The  rotational  symmetry  in  the  geometry  is  exploited  using  a  Fourier  modal  expansion.  Expanding 
the  azimuthal  field  dependence  as  a  Fourier  series,  the  problem  space  is  effectively  reduced  to  two  dimensions, 
but  the  solution  is  three  dimensional;  the  code  solves  for  the  full-wave  vector  Maxwell  equations.  The  code 
was  developed  to  simulate  the  TEM  wave  propagation  inside  a  coaxial  transmission  line  system. 

The  code  handles  two  specific  discontinuities  in  a  coaxial  transmission  line.  The  first  application  simulates 
the  dme  evolution  of  the  field  components  in  a  non-dispersive  dielectric  material  and  includes  step  disconti¬ 
nuities  in  the  coaxial  geometry.  This  feature  was  developed  in  order  to  investigate  the  transition  from  coaxial 
connector  to  coaxial  cavity.  The  code  is  based  on  tabulated  formulae  in  [1,  2]. 

The  second  application  simulates  the  time  evolution  of  the  field  components  in  a  coaxial  line  with  a  dis¬ 
continuity  in  the  dielectric.  The  dielectric  change  may  involve  a  dispersive  dielectric  material.  The  finite 
difference  equations  of  Maxwell’s  equations  for  dispersive  materials  were  derived  for  the  body  of  revolution 
formulation  [3].  The  dispersive  materials  are  handled  by  the  discretised  convolution  integral  with  recursion,  as 
suggested  by  [4].  The  dispersive  dielectric  is  modelled  by  a  time-dependent  susceptibility  function  with  con¬ 
stant  parameters.  Only  the  Debye  model  for  the  permittivity  was  implemented  in  the  code  (Appendix  A).  The 
Lorentz  and  Drude  models  for  permittivity  can  easily  be  added  to  the  code,  using  the  same  recursion  formulae. 

The  excitation  pulse  has  the  1/p  spatial  distribution  of  the  TEM  (k=0)  mode.  The  time-history  is  Gaussian. 
The  source  treatment  is  a  total  field/scattered  field  treatment  at  the  source  plane.  The  details  may  be  found 
in  [1,  §3.2].  It  simulates  a  plane  wave  with  1/p  spatial  weighting  in  a  homogeneous  non-dispersive  dielectric 
space.  The  fields  on  and  above  this  plane  are  assumed  to  be  total  fields  (i.e.  incident  and  reflected  fields); 
the  fields  below  this  plane  are  reflected  (scattered)  fields. 

The  r  =  0  plane  is  treated  using  a  one-dimensional  absorbing  boundary  condition  for  the  Eta„  field  compo- 
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Figure  2:  Geometry  of  the  coaxial  cavity  sys-  Figure  3:  Equivalent  geometry  of  the  coaxial 

tern  with  a  stepped  discontinuity.  cavity  system  with  a  dielectric  discontinuity. 

nents'  the  plane  can  either  similarly  be  treated  ;is  an  absorbing  boundary  condition  or  as  a  perfect 

electric  conductor,  depending  on  the  load.  The  constant  p  boundary  planes  (the  inner  and  outer  conductors 
of  the  coaxial  transmission  line)  are  treated  a-s  jierfect  electric  conductors. 


3  Modelling  of  a  stepped  disontinuity  in  a  coaxial  transmission 
line 


The  transition  from  the  feeding  connector  to  the  coaxial  cavity  involves  a  stepjied  discontinuity  in  the  coaxial 
geometry  (Figure  1).  Changing  the  value  of  S,  changes  the  amount  of  reflection  that  will  occur  from  the 
transition.  The  discontinuity  was  investigated  numerically  and  experimentally  to  optimize  the  design  of  the 
sample  holder. 

The  geometry  for  the  BOR-FDTD  analysis  is  shown  in  Figure  2.  It  consists  of  the  perfectly  conducUng 
areas  of  the  inner  and  outer  conductors  surrounding  a  region  filled  with  a  dielectric.  The  physical  dimensions 
shown  in  Figure  1  for  the  coaxial  cross-sections  are  used.  The  coaxial  line  is  filled  with  Teflon  with  £r  -  2.0. 

The  time-history  of  the  field  components  at  a  2-plane  before  the  step  discontinuity  is  stored  for  output. 
This  plane  is  selected  far  enough  from  the  discontinuity  so  that  only  the  TEM  (to  z)  field  components  need 
to  be  considered.  The  reflection  coefficient  at  this  plane  is  calculated  by  taking  the  Fourier  transform  of  the 
reflected  wave  and  dividing  it  by  the  Fourier  transform  of  the  incident  wave. 

Simulations  were  run  for  several  lengths  of  6  and  the  Fourier  transform  of  the  solution  was  computed 
to  obtain  the  frequency  dependent  reflection  coefficient  caused  by  the  transition  (Figure  4).  The  reflections 
from  the  transition  between  the  coaxial  line  and  the  coaxial  cavity  were  measured  with  an  HP  8753C  network 
analyser.  The  offset  S  was  changed  systematically  to  find  the  optimum  value  for  the  least  reflection. 

The  BOR-FDTD  analysis  of  the  coaxial  transmission  line  system  with  a  stepped  discontinuity  was  found 
to  be  consistent  with  measurements.  The  BOR-FDTD  analysis  predicted  an  optimum  value  of  S  =  1.7mm 
with  a  reflection  coefficient  between  -35dB  and  -34dB  over  the  6GHz  frequency  band.  This  corresponds  well 
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Figure  4:  BOR-FDTD  simulation:  reflection  coefficient  for  different  values  of  S. 


with  the  experimentally  determined  value  of  S  ^  1.6mm  which  yielded  a  reflection  coefficient  between  -40dB 
and  -33dB  over  the  30MH2  to  6GHz  band. 


4  Modelling  of  a  dispersive  material  in  a  coaxial  transmission  line 


The  BOR-FDTD  formulation  for  the  analysis  of  the  measurement  system,  consisting  of  a  short-circuited 
coaxial  cavity  filled  with  a  dispersive  material  and  driven  by  a  coaxial  transmission  line,  uses  an  equivalent 
geometry.  The  measuring  system  is  modelled  as  a  50Q  transmission  line  (with  Teflon  dielectric)  feeding  a 
transmission  line  containing  the  liquid  dielectric  (Figure  3).  Due  to  the  low  reflections  from  the  stepped 
discontinuity,  the  transition  is  simply  modelled  as  a  oOQ  transmission  line.  The  distance  from  the  calibration 
(observation)  plane  to  the  Teflon  bead/cavity  interface  was  determined  experimentally  from  a  time  domain 
measurement  on  the  network  analyser. 

The  predictions  of  the  code  for  the  reflection  coefficient  of  a  transition  from  Teflon  to  distilled  water  are 
compared,  in  the  frequency  domain,  both  to  data  measured  by  an  HP  8753C  network  analyzer  and  to  the 
numerical  predictions  of  a  transmission  line  equivalent  circuit  model. 

The  water  is  modelled  as  a  Debye  dispersive  medium,  with  typical  parameters  (dependent  on  pressure  and 
temperature)  £,  =  81,  =  1.8  and  r,  =  9.4ps  [4].  The  parameters  are  supplied  to  the  code  and  the 

BOR-FDTD  equations  are  updated  at  each  time  step  using  the  discretised  convolution  integral  equation  and 
recursion  (Appendix  A).  The  same  parameters  were  used  in  the  BOR-FDTD  analysis  and  the  transmission 
line  analysis. 

Although  the  Teflon  bead  that  shields  the  connector  and  transition  region  from  the  liquid  in  the  cavity  fits 
tightly  into  the  cavity,  water  leaked  down  the  sides  of  the  conductor  surfaces  into  the  transition  region.  This 
had  a  significant  influence  on  the  measured  reflection  coefficient.  Silicon  gel  was  applied  to  the  surfaces  of  the 


394 


0 


Figure  5:  Magnitude  of  the  reflection  coefficient  with  purified  water 
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Fi^re  6:  Phase  of  the  reflection  coefficient  ».th  purified  water  inside  the  caeily.  A 
results  of  the  BOR-FDTD  analysis,  transmission  line  analysis  and  measurement,  arame 
£,  =  81,  £oo  =  1-8  and  r,  =  9.4ps. 


Teflon  bead  to  prevent  water  from  entering  the  transition  region. 

After  the  experimental  diflEculties  were  eliminated,  very  good  results  were  obtained  over  the  entire  band 
from  30MHz  to  6GHz.  The  results  for  the  reflection  coefficient  are  shown  in  Figure  5  for  the  magnitude  and 
in  Figure  6  for  the  phase.  The  comparison  between  the  BOR-FDTD  analysis,  the  measured  magnitude  of 
the  reflection  coefficient  and  the  transmission  line  analysis  is  good.  The  results  of  the  analyses  are  in  exact 
agreement  for  frequencies  above  2GHz.  Below  2GHz  the  agreement  is  good,  although  not  exact.  The  BOR- 
FDTD  and  transmission  line  results  for  the  phase  of  the  reflection  coefficient  are  in  excellent  agreement  over 
the  frequency  range.  The  phase  shift  in  the  high  frequency  (2.5  to  6GHz)  result  of  the  measured  phase  can 
be  attributed  to  the  effect  of  the  silicon  gel. 

The  rapid  changes  in  the  reflection  coefficient  require  a  high  re.solution  in  the  frequency  data  of  the  anal¬ 
yses.  A  stability  and  phase  error  analysis  should  be  conducted  to  determine  the  stability  restriction  for  this 
system  of  partial  differential  equations  and  discretised  convolution  integral.  For  accuracy  the  discretization 
has  to  adequately  sample  the  shortest  timescale  in  the  iiroiilem  [5].  The  relaxation  timescale  for  water  was 
taken  as  9.4ps  while  the  timestep  size  was  0..37ps  (i.e.  At  =  Finer  timestepping  should  improve  the 

results  of  the  BOR-FDTD  analysis. 


5  Conclusion 


A  common  discontinuity  in  a  coaxial  system  occurs  when  the  dimensions  of  the  line  are  scaled  up,  while 
characteristic  impedance  is  maintained.  The  line  will  no  longer  be  reflectionless,  in  general.  It  was  shown  how 
a  BOR-FDTD  formulation  of  a  coaxial  system  with  a  stepped  coaxial  discontinuity  was  used  to  optimize  the 
transition  in  the  system.  The  predicted  optimum  value  of  the  analysis  (6  =  1.7mm),  as  well  as  the  predicted 
optimum  reflection  coefficient  (between  -35dB  and  -32dB  over  the  6GIIz  frequency  band)  was  found  to  be 
consistent  with  measurements,  which  yielded  an  optimum  value  of  i5  ss  l.Gmin  and  a  reflection  coefficient 
between  -40dB  and  -33dB  over  the  30MHz  to  6GHz  band. 

A  BOR-FDTD  formulation  for  wave  propagation  inside  a  coaxial  system  containing  a  dispersive  dielectric 
material  was  discussed.  The  Debye  model  for  a  dispersive  dielectric  material  was  sucessfuliy  incorporated 
in  the  BOR-FDTD  formulation,  using  the  discreti.sed  convolution  integral.  The  predictions  of  the  code  for 
distilled  water  compare  well  in  the  frequency  domain  to  measured  data  and  the  numerical  predictions  of  a 
transmission  line  equivalent  circuit. 

Appendix  A 

Dispersive  dielectric  materials  can  be  modelled  by  u.se  of  the  convolution  integral  constitutive  relationship 
between  the  displacement  flux  and  the  electric  field, 

D(x,/)  =  £oec«E(x,/) -F  /  £:oX(-)E(x.t  -  r)d'  (1) 

Jn 

£r(tv)  =  Sr(cc)  +  ,\;(w)  (2) 

x(")  is  determined  as  the  Fourier  transform  of  the  frequency  dependent  susceptibility  function  x(w).  Kunz 
and  Luebbers  [4]  discretise  this  integral  to  use  in  the  finite  difference  Maxwell  equations, 

D"+i(f,j)  -  D"(i,i)  =  so(e^  +  (3) 

where 

Ax'"  =  x'"  -  (4) 

rj-l 

jIAx"*  (5) 

m=0 
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i,j  and  n  are  the  indexes  for  the  p,  z  and  time  dimensions  respectively. 

The  convolution  sum  in  Equation  (3)  is  dependent  on  the  time  domain  susceptibility  function.  If  the  su^ 
ceptibility  function  has  an  exponential  time  dependence,  the  convolution  sum  can  be  written  as  a  tecursive 
tola.  Recursive  formulas  are  derived  by  Kuns  and  Luebbers  fo,  Debye  and  Lorents  dispers.ve  media  [4, 
pp.  127-145].  The  formulas  for  a  Debye  medium  are  repeated  here  for  reference. 

Debye  model  for  permittivity  _  c- 

Discretised  time  domain  susceptibility  function 

X(7iAr)  =  u(nAr)  (V 


Recursive  property 


=  A.Y"‘e 


Recursion  formula 


;■)  =  (i,  j)  +  e-^  4'”-  ‘(i,  i) 
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Hardware/Software  Codesign  Model  for  Xpatchf 
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Air  Force  Institute  of  Technology 


1.  Introduction 


1.1  Background 

The  Xpatch  [1]  software  package  generates  very  accurate  field  scattering  predictions.  At  present,  Xpatch  has  over 
350  users  worldwide  with  that  number  growing  every  day.  In  addition,  extensive  research  has  been  conducted  to  validate 
the  accuracy  of  the  Xpatch  software  package  [2]-[4].  Thus,  given  a  detailed  description  of  an  object  and  a  specific  viewing 
angle,  Xpatch  can  accurately  predict  the  object’s  scattered  field. 

'ae  Xpatch  software  package  consists  of  two  programs,  Xpatchf  and  Xpatcht.  These  programs  are  for  frequency 
domain  analysis  and  time  domain  analysis  respectively.  The  research  presented  in  this  paper  focuses  on  the  frequency 
domain  version  and  produces  a  fimctional  codesign  model  of  the  Xpatch  software.  Therefore,  the  products  of  this  research 
are  the  behavioral  description  of  the  hardware  and  the  specification  of  the  interface  between  the  software  platform  and  the 
hardware  elements.  The  ultimate  goal  is  an  overall  speed-up  of  prediction  time. 

1.2  Assumptions 

Since  extensive  validation  of  the  Xpatch  system  has  already  been  conducted,  this  research  does  not  validate  Xpatch 
as  a  scattered  field  prediction  program.  However,  the  codesign  system  was  tested  to  ensure  operation  is  consistent  with  the 
existing  Xpatchf.  Further,  the  software  platform  is  assumed  to  be  a  single,  general  purpose  microprocessor 
communicating  with  the  hardware  components  through  shared  memory.  In  order  to  exercise  Xpatchf  in  a  realistic 
fashion,  three  separate  objects  were  selected.  These  objects;  a  boxfin,  a  turkey,  and  an  aircraft  [1],  provided  a  realistic 
exercise  of  the  Xpatchf  run-time. 

13  Approach 

This  research  consists  of  two  parts.  First,  the  existing  Xpatchf  software  was  analyzed  and  profiled  to  reveal  areas  of 
optimization.  Then,  based  on  the  results  of  the  profiling  analysis,  the  existing  software  system  was  partitioned  between 
hardware  and  software. 

13.1  Profiling 

The  Xpatchf  software  was  profiled  in  several  ways.  All  methods  entailed  the  scattered  field  prediction  of  a  variety  of 
objects.  These  objects  were  sufficiently  complex  so  as  to  vigorously  exercise  the  Xpatchf  computation  engine. 

Extensive  run-time  analysis  was  used  to  determine  which  subroutines  and  functions  were  used  most  often  and  to 
identify  periodicity  in  Xpatchf  execution,  Periodicity  or  cyclic  operation  would  be  used  to  optimize  the  execution  of  the 
system  by  allowing  individual  cycles  to  operate  concurrently.  Each  subroutine  and  function  was  assigned  a  unique 
identification  number.  This  number  was  used  to  track  the  sequence  of  subroutine  and  function  calls  during  run-time. 

Figure  1  shows  a  possible  Xpatchf  profile.  The  horizontal  axis  represents  time,  and  the  vertical  axis  represents 
different  functions  or  subroutines  that  are  called  during  Xpatchf  operation.  If  the  cycle  shown  in  Figure  1  can  be  executed 
concurrently,  the  time  for  the  3  cycles  can  be  collapsed  down  to  the  time  for  only  one  cycle.  Profiling  also  identified  data 
interdependencies.  In  order  to  take  advantage  of  the  periodic  nature  of  the  software,  the  data  being  manipulated  in  each 
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cycle  must  be  independent.  Several  areas  of  the  Xpatchf  software  were  identified  as  periodic.  Further  analysis  was 
conducted  to  determine  which  area  would  benefit  the  most  &om  implementation  in  hardware. 


Cydieggeration 


Time 


Figure  1.  Periodic  Nature  of  a  Program 


1 3.2  Behavioral  Description 

Once  an  area  of  the  Xpatchf  software  was  identified  as  a  candidate  for  hardware  implementation,  the  functionality  of 
that  area  was  described  using  the  Very  High  Speed  Integrated  Circuit  (VHSIC)  Hardware  Description  Language  (VHDL). 
VHDL  was  used  to  obtain  a  structural  decomposition  of  the  source  code  functionality.  The  structural  decomposition  was 
made  in  the  context  of  possible  hardware  implementation  strategies.  Attention  was  given  to  obtain  the  greatest  advantage 
of  concurrency.  Additionally,  the  interface  between  the  VHDL  components  and  the  revised  Xpatchf  software  was 
specified.  Once  the  VHDL  model  was  obtained,  the  model  was  verified  for  correct  operation  using  real  xpatchf  inputs. 

2.  Software  Profiling 

As  explained  in  another  paper  in  this  volume,  the  Xpatchf  program  flow  was  determined  by  profiling  the  software  in 
the  run-time  environment.  This  was  done  by  tracking  function  and  subroutine  calls  in  both  the  FORTRAN  code  and  the  C 
code.  Since  most  of  the  C  code  functions  exist  as  library  functions,  the  melding  of  FORTRAN  subroutine  calls  with  the  C 
code  function  calls  presented  a  major  difficulty.  There  was  no  easy  way  to  determine  the  temporal  relationship  between 
the  FORTRAN  and  C  source  code.  The  only  alternative  was  a  line-by-line  analysis  of  the  source  code  to  determine  the 
subroutine/function  hierarchy.  These  temporal  relationships  were  used  to  identify  the  Xpatchf  functionality  that  would 
achieve  the  most  benefit  ft^om  a  hardware  implementation.  The  results  of  that  analysis  are  summarized  in  Table  1  and 
shown  pictorially  in  Figure  2. 


Table  1.  Run-time  Percentages 


II  Obiect 
r Borfin 

0.10% 

1st  Bounce 

11.16% 

Multi-Bounce  I 
87.97% 

Turkey 

0.63% 

2.94% 

95.56% 

1!  VI  Aircraft 

12.85% 

42.37% 

42,17% 
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Figure  2.  Run-time  Summary 


The  times  of  execution  are  shown  as  percentages  of  the  total  time  of  execution.  This  provides  a  normalized  view  of 
the  relative  time  spent  in  each  phase.  It  should  be  noted  that  not  all  of  the  run-time  is  accounted  for  in  the  above  table. 
The  pre-  and  post-processing  are  not  shown  and  therefore,  the  percentages  of  each  row  do  not  add  up  to  100%.  Since  the 
SBR  portion  of  the  code  required  the  largest  portion  of  the  run-time,  it  was  decided  that  this  area  would  most  benefit  from 
a  hardware/software  codesign  implementation. 

3.  VHDL  Model 

Since  the  calculations  involved  in  determining  a  ray/facet  intersection  are  independent  of  other  intersection 
calculations,  the  processing  of  many  intersections  may  proceed  simultaneously.  This  fact  is  used  to  provide  a  significant 
speed-up  in  the  Xpatchf  processing  of  the  multi-bounce  calculations.  As  a  result,  the  existing  Xpatchf  program  will  be 
modified  to  operate  with  the  codesign  model  hardware  developed  during  this  research. 

The  method  used  to  implement  the  xpatchf  calculations  is  known  as  Single  Program,  Multiple  Data  or  SPMD.  The 
same  program  operates  concurrently  on  several  sets  of  input  data  in  order  to  produce  several  sets  of  results  all 
simultaneously  (or  very  nearly  so).  Figure  3  illustrates  the  concept.  The  input  data  \s  flowed  across  the  top  of  the  array  of 
processing  units.  Once  the  data  is  properly  positioned,  each  processing  unit  performs  its  computations.  The  results  are 
flowed  out  in  the  same  marmer  as  the  data  was  flowed  in.  With  the  abstract  functionality  described,  the  particulars  of  the 
implementation  may  be  discussed. 

At  the  top  level  there  is  a  model  that  includes  a  general  purpose  microprocessor  and  associated  RAM/ROM.  This 
microprocessor  is  responsible  for  providing  overall  control  of  the  processing  units  to  include  data  flow  control.  This  takes 
care  of  the  input  data  flow  plane  and  the  output  data  flow  plane.  The  processing  units,  called  Voxel  Units  (VU),  consist  of 
three  modules  each.  The  first  module,  called  the  Global  Memory  Module  (GMM),  is  used  to  store  “global”  object  data. 
The  second  module,  called  the  Local  Memory  Module  (LMM),  provides  the  processing  necessary  to  traverse  the  BSP  tree 
as  it  is  stored  in  the  GMM  and  to  retrieve  facet  data  once  a  leaf  voxel  is  found.  The  third  and  last  module,  called  the 
Computation  Module  (CM),  provides  the  floating  point  processing  necessary  to  implement  the  intersection  calculations 
identified  for  the  codesign. 
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Figure  3.  SPMD  Operation 


3.1  Microprocessor  Controller 

The  microprocessor  controller  is  responsible  for  overall  control  to  include  data  flow  control.  The  microprocessor  can 
be  any  general  purpose  processor  such  as  one  of  the  PowerPC  60X  series.  While  it  is  easy  to  talk  about  simply  flowing 
data  across  the  processing  units,  the  actual  implementation  must  be  made  in  the  context  of  finite  data  bus  sizes  and  finite 
time  required  for  data  transfers.  Iromediately  after  the  object  facet  data  is  loaded  by  the  modified  Xpatchf  program,  the 
pertinent  object  data  is  broadcast  to  each  VU,  specifically  to  each  GMM.  The  object  data  is  maintained  in  the  Xpatchf 
program  in  an  abstract  data  type  (ADT)  called  data.  Only  the  data  relevant  to  the  object  need  be  copied  to  the  VUs. 
Since  the  data  is  maintained  and  managed  by  the  C  library  functions,  pointers  are  used  for  arrays  while  discrete  values  are 
contained  in  memory  allocated  at  run-time.  In  addition  to  the  dynamic  arrays  and  discrete  memoiy  elements,  the  ADT 
data  contains  a  pointer  to  the  head  of  the  Binary  Space  Partition  (BSP)  tree. 

3.2  Global  Memory  Module  (GMM) 

The  GMM  will  consist  of  dynamic  RAM  and  cache  memory.  Both  of  these  items  are  available  firom  industry  and  are 
extremely  reliable,  therefore  no  model  is  provided  for  this  unit  except  for  that  which  is  necessary  to  demonstrate  the 
function  of  the  other  modules.  Once  the  appropriate  global  data  is  in  place  in  the  LMM  and  the  CM,  toe  Voxel  Unit  (VU) 
array  remains  in  an  idle  state  until  toe  modified  Xpatchf  program  makes  a  call  for  ray/object  intersections. 

When  intersection  calculations  are  required,  toe  modified  Xpatchf  program  signals  toe  VU  array  (by  writing  to  a 
predetermined  address).  The  signal  is  followed  by  toe  transfer  of  ray  data  to  toe  VU  array.  The  ray  data  consists  of  toe  ray 
start  point  and  toe  ray  direction  vector  (a  unit  vector).  One  ray  is  assigned  to  each  VU  along  with  toe  ID  of  toe  last  facet 
hit  by  toe  ray  (meaningless  for  initial  rays).  The  ID  of  toe  last  facet  hit  ensures  that  this  facet  is  not  processed  for  the  next 
intersection. 

A  possible  method  of  flowing  toe  ray  data  to  all  of  toe  VUs  is  shown  in  Figure  4.  Assuming  that  there  are  four 
separate  data  channels  to  toe  VU  array,  a  set  of  4  ray  data  elements  may  be  transferred  to  toe  VU  array  at  a  time.  For 
brevity’s  sake,  assume  toe  array  is  4  wide  and  oidy  4  deep.  With  every  memoiy  cycle,  4  ray  data  elements  are  transferred 
to  toe  VU  array.  The  horizontal  lines  in  toe  Data  Flow  portion  of  Figure  4  represent  computational  processing  and  are 
shown  to  last  longer  than  4  memory  cycles.  It  will  be  shown  later  that  this  is  indeed  toe  case.  In  fact,  the  computational 
processing  will  last  much  longer  than  4  memory  cycles.  The  first  results  (a  set  of  4  result  elements)  will  be  available  after 
toe  first  row  of  VUs  finish  their  computations.  Let  this  time  be  tc  memory  cycles.  Then,  for  toe  next  4  memory  cycles,  4 
result  elements  will  be  available  to  toe  main  program.  In  general,  for  n  blocks  of  16  rays  each,  toe  processing  time  in 
memory  cycles  is  given  by  + 1)  +  2 . 
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Figure  4.  Data  Flow  Relationships 


The  number  of  blocks  can  be  specified  in  terms  of  the  total  number  of  rays.  If  the  number  of  rays  is  k,  then  n  satisfies 
K  =  j" — j  and  therefore  + 1)  +  2 .  This  is  an  upper  bound  on  the  total  computation  time  for  the 


VU  array.  It  is  evident  from  this  equation  that  for  k»  16  and  tc »  1,  the  total  time  can  be  approximated  by  = 


16’ 


However,  the  total  time  to  compute  k  rays  sequentially  is  simply  ktc  (assuming  the  sequential  computation  time  and 
concurrent  computation  time  for  one  ray  are  about  equal).  From  this  analysis,  the  speed-up  obtained  is  approximately 
equal  to  the  number  of  VUs  in  the  VU  array.  The  above  equation  represents  an  upper  bound  on  the  computation  time  that 
can  never  be  achieved,  but  for  a  first  order  approximation,  it  is  useful  to  show  the  relationship  between  speed-up  and 
number  of  VUs. 


33  Local  Memory  Module 

The  Local  Memory  Module  (LMM)  provides  memory  management  for  the  VU.  When  data  is  required  by  the 
Computation  Module  (CM),  the  LMM  generates  the  addresses  for  that  data.  The  controller  for  the  LMM  is  implemented 
using  microcode.  Another  feature  of  the  LMM  is  the  implementation  of  a  hardware  stack.  This  is  included  in  order  to 
allow  traversal  of  the  BSP  tree  when  looking  for  leaf  voxels. 

3.4  Computational  Module 

The  CM  performs  the  calculations  necessary  to  determine  the  path  throught  the  BSP  tree  and  whether  a  ray  intersects 
the  object.  Once  a  leaf  voxel  is  reached,  the  CM  performs  the  ray/facet  intersection  calculations  for  each  facet  in  the  leaf 
voxel.  When  all  facets  have  been  processed,  the  CM  outputs  the  results  of  those  calculations.  Those  results  include 
whether  a  hit  was  made,  the  coordinates  of  the  closest  hit  point,  the  distance  from  the  ray  origin  to  the  hit  point,  and  the 
ID  of  the  facet  invcV  in  the  intersection. 
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3.5  Voxel  Unit 

The  Voxel  Unit  (VU)  combines  the  Global  Memory  Module,  Local  Memory  Module  and  Computational  Module 
together  and  provides  the  data  and  address  routing  necessary  for  proper  operation.  Figure  5  shows  a  block  diagram  of  the 
VU. 
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Figure  5.  Voxel  Unit 


The  combinational  logic  manages  the  routing  of  data  and  address  signals  within  the  VU.  Addresses  are  provided  to 
the  GMM  by  the  microprocessor  controller  when  loading  global  data.  When  traversing  the  BSP  tree  or  computmg  a 
ray/facet  intersection,  the  addresses  to  the  GMM  are  provided  by  the  LMM.  Data  is  written  to  the  GMM  by  the 
microprocessor  controller  when  loading  global  data,  otherwise  data  is  read  by  the  LMM  when  traversing  the  BSP  tree  or 
data  is  read  by  the  CM  when  a  ray/facet  intersection  is  to  be  calculated.  Data  is  provided  to  the  CM  by  the  mcroprocessor 
when  a  ray  is  assigned  else  data  is  provided  by  the  GMM  in  response  to  LMM  addresses  when  a  leaf  voxel  is  found. 

4.  ValidationA^erification 

Testing  was  accomplished  using  test  address  data  and  actual  Xpatchf  data  generated  from  a  run  on  the  boxfin  object 
This  data  was  used  to  verify  the  proper  operation  of  the  Local  Memory  Module  (LMM)  and  the  Computation  Module 
(CM)  The  LMM  was  tested  by  verifying  address  outputs  against  expected  values  of  data  addresses.  The  CM  was  tested  by 
applying  known  data  and  checking  results  against  known  good  data  from  the  Xpatchf  run.  The  data  used  for  verification 
included  the  bounding  box  for  the  boxfin  object,  data  for  two  rays,  and  data  for  four  facets  with  results  of  two  hits  and  six 
misses  The  Voxel  Unit  (VU)  was  created  by  integrating  the  LMM  and  CM  with  a  GMM  simulation.  The  VU  was  then 
tested  by  performing  a  simulated  run  with  the  available  data.  Again,  the  results  obtained  were  verified  against  known 
good  data. 

5.  Results  and  Conclusion 


5.1  Performance 

Performance  for  the  VHDL  model  is  measured  relative  to  the  source  code  being  implemented,  using  floating  point 
(FP)  operations  as  the  metric.  The  number  of  FP  operations  in  the  source  code  implemented  in  the  VU  model  is 
dependent  on  the  depth  of  the  BSP  tree,  die  number  of  leaf  voxels  processed,  and  the  number  of  facets  in  a  leaf  voxel. 

The  function  of  processing  a  ray  can  be  divided  into  4  parts.  These  parts  and  the  associated  number  of  FP  operations 
are  summarized  in  Table  2. 
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Table  2.  FP  Operations  for  Ray  Processing 


Compute: 

Xpatchf 

VU  1 

Preprocessing 

27 

28 

BSP  Traversal  (per  level) 

6 

9 

Intersection  Processing  (per  facet) 

33 

30 

II  Post  Processing 

7 

7 

In  general,  the  number  of  FP  operations  for  the  Xpatchf  source  code  is  given  by  -  34+6t  +  {33k)  j,  and 

the  number  of  FP  operations  for  the  VU  model  is  given  by  Xy^j  =35  +  9i  +  (30it)  j  where  i  is  the  number  of  levels 
traversed,  j  is  the  number  LVs  processed,  and  k  is  the  number  of  facets  in  each  of  the  LVs. 

The  MFLOPS  (millions  of  FP  operations  per  second)  of  the  CM  is  assumed  to  be  approximately  equal  to  that  for  a 
general  purpose  workstation.  This  assumption  is  based  on  the  fact  that  the  CM  is,  in  reality,  a  specialized  floating  point 
processor  and  is  expected  to  have  FP  processing  capabilities  on  par  with  commercially  available  FP  processors.  The 
features  which  set  the  CM  apart  from  general  purpose  FP  processors  are  the  hardware  stack  and  the  special  looping 
capability.  The  advantage  of  the  VU  architecture  is  the  ability  to  array  the  units  to  take  advantage  of  concurrent  operation. 
If  it  is  assumed  that  the  MFLOPS  rating  for  the  VU  model  is  the  same  as  that  for  a  general  purpose  workstation,  then  the 
execution  time  of  the  Xpatchf  source  code  can  be  compared  to  the  execution  time  of  the  VU  model  for  one  ray. 

This  discussion  has  shown  that  the  performance  of  the  VU  model  for  one  ray  is  approximately  equal  to  that  of  a 
general  purpose  microprocessor  executing  the  Xpatchf  source  code.  However,  the  advantage  of  the  VU  model  is  that  it 
may  be  arrayed  and  many  VUs  operated  concurrently.  If  arrayed,  the  MFLOPS  is  multiplied  by  a  factor  approximately 
equal  to  the  number  of  VUs  included  in  the  array.  The  result  woxild  be  a  “card”  with  all  the  components  necessary  to 
implement  one  array  of  VUs.  The  components  would  include  the  microprocessor  controller  and  its  associated  memory 
and  the  array  of  VUs  (such  as  the  4  x  4  array  as  shown  in  Figure  4).  From  this  arrangement,  the  approximate  speed-up 
kt 

would  be  16  from  In  general,  if  an  n  x  m  array  is  used,  the  speed-up  would  be  approximately  nm.  The  n 

16 

dimension  is  limited  to  the  number  of  separate  memory  channels  between  the  workstation  main  memory  and  the  VU 
array.  The  m  dimension  is  limited  by  the  processing  time  of  a  VU  element  and  the  time  required  to  transfer  ray  data  from 
the  workstation  and  the  VU  array. 

A  set  of  VU  array  “cards”  could  more  efficiently  use  the  available  memory  bandwidth  and  increase  speed-up.  This  is 
possible  since  the  a  VU  spends  a  majority  of  its  time  doing  computations  and  not  memory  accesses.  If  8  cards  were 
included  in  one  workstation,  the  speed-up  would  be  approximately  128  or  2  orders  of  magnitude  improvement  in 
processing  time. 

Additional  speed  up  could  be  obtained  by  networking  these  workstations.  If  N  “Xpatchf’  workstations  were  cotmected 
in  a  network,  the  speed-up  would  be  128iV.  This  figure  is  obtained  for  a  system  where  communication  costs  are  zero. 
However,  in  reality  there  would  be  a  time  penalty  associated  widi  every  message  passed  between  workstations.  When 
considering  performance  of  a  system  that  incorporates  message  passing,  the  metric  used  must  be  execution  time.  Speed¬ 
up  is  related  to  execution  time  given  by  = - — -  where  is  the  time  for  the  original  system.  When 

speed-up 

considering  the  delays  associated  with  message  passing,  the  execution  time  will  increase  by  an  amount  proportional  to  the 
number  of  messages  passed.  In  the  case  of  an  Xpatchf  “network,”  message  passing  would  only  be  necessary  to  assign  ray 
data  elements  and  signal  completion  of  ray/object  computations.  Other  message  passing  would  be  necessary  to  initialize 
each  workstation  with  common  data  (associated  with  the  object).  However,  this  should  not  be  considered  as  part  of  the 
ray/object  computation  time  since  the  system  could  be  initialized  once  with  object  data  and  reused  for  multiple  incident 
angles;  i.e.,  calculating  multiple  predictions.  Messages  would  be  necessary  to  pass  new  angles  to  each  workstation.  Delays 
associated  with  these  messages  are  negligible  compared  to  the  number  of  messages  associated  with  assigning  ray  data 
elements. 

Consider  a  star  network  topology  with  one  controlling  workstation  and  -  7  other  nodes.  Rays  are  fired  from  a 
virtual  screen  similar  to  that  used  for  the  Z-buffer.  Therefore,  the  natural  choice  is  to  divide  the  screen  into  N  blocks  and 
assign  each  node  a  single  block.  This  approach  will  be  used  for  the  remainder  of  this  discussion,  but  it  should  be  evident 
that  this  is  not  the  most  optimum  choice.  There  is  a  high  probability  that  entire  blocks  of  rays  would  miss  the  object 
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completely.  This  would  result  in  some  nodes  setting  idle  for  long  periods  of  time  while  others  continued  processing 
computations.  A  method  of  load  balancing  would  alleviate  this  problem.  Since  this  is  outside  the  scope  of  this  research, 
no  further  discussion  wiU  be  given.  The  simplistic  network  model  wiU  be  used  to  demonstrate  the  commumcation  costs  of 

a  network  implementation.  ,  ,  .  j  ... _ j  „ 

The  number  of  messages  associated  with  assigning  rays  is  iV  -  1  since  flie  control  workstation  d^s  not  nwd  a 
message.  If  workstations  only  pass  results  when  all  rays  assigned  have  been  processed,  there  will  be  an  addition^  N  -  1 
completion  messages.  It  is  also  assumed  that  multiple  bounces  of  a  ray  are  processed  on  the  workstation  to  which  the 
original  ray  assignment  was  made.  The  total  number  of  messages  is  UN  -  1).  The  new  processmg  time  is  give  by 


f  =  ioid.  +  2a(N  -1).  The  equation  is  generalized  for  a  speed-up  of  s. 

sN 

The  parameter  a  is  a  constant  of  proportionality  that  allows  the  message  delay  conttibution  to  be  incorporated  into 
the  equation  for  the  new  time  of  execution.  The  value  for  a  is  unknown  but  could  be  obt^ed.  It  represents  the  magnitude 
of  the  effect  of  message  passing  on  the  networked  system  performance.  Since  the  desire  is  to  mmimize  executioii  tune,  the 
optimum  number  of  nodes  can  be  found  by  setting  the  first  derivative  of  the  above  equation  to  zero  and  solving  for  N. 

This  solution  is  N  =  .  It  can  be  observed  from  this  solution  that,  as  the  processing  time  increases,  the  number  of 

\2sa 

nodes  must  be  increased  to  maintain  optimum  performance.  However,  as  a  increases,  the  number  of  nodes  must  be 
reduced  to  maintain  optimum  performance. 


5.2  Conclusion 

The  research  described  in  this  paper  resulted  in  an  architectural  model  for  improving  Xpatchf  processing.  The 
optimization  was  accomplished  by  implementing  highly  repetitive  computations  in  a  VHDL  model.  Because  of  *e  nature 
of  the  computations  being  implemented  in  VHDL,  the  research  product  has  applications  outside  of  scattered  field 
predictions.  The  implementation  is  the  foundation  for  a  ray  tracing  algorithm.  Therefore,  the  area  of  possible  use  is 
expanded  to  the  graphics  processing  arena  and  may  prove  to  be  very  useful  for  speeding  up  graphics  processmg 
applications. 
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I.  Introduction 

The  location  of  buried  waste  by  some  sort  of  ground  penetrating  EM  method  would  be  a 
valuable  tool  in  locating  hazardous  materials  that  would  be  both  difficult  and  dangerous  to  find  by 
invasive  methods.  One  approach  under  consideration  is  the  use  of  RF  applicators  to  launch  EM 
pulses  whose  echoes  can  be  analyzed  to  determine  if  a  foreign  object  lay  beneath  the  ground. 

Such  an  approach  is  plagued  by  numerous  practical  and  theoretical  difficulties:  unknown 
terrain  under  the  surface,  lossy  materials  in  the  terrain  which  attenuate  the  signal,  the  fact  that  the 
buried  objects  can  be  of  unknown  size  and  composition,  etc.  Because  of  the  complexity  of  the 
problem,  it  is  believed  that  computer  simulation  can  play  an  invaluable  role  in  finding  the  best 
solution  to  overcome  these  difficulties. 

Using  the  finite-difference  time-domain  (FDTD)  method  [1,  2,  3],  programs  have  been 
developed  to  simulate  a  plane  EM  wave  penetrating  into  the  earth  and  reflecting  off  various  objects. 
Although  any  practical  applicator  will  ultimately  generate  a  signal  other  than  a  plane  wave,  it  is 
believed  that  this  is  a  valid  approach  in  gaining  insight  into  the  problems.  The  program  can  simulate 
any  material  over  a  wide  frequency  range,  and  scattered  objects  can  consist  of  canonical  shapes  or 
arbitrary  shapes  determined  at  run  time.  Furthermore,  analytic  programs  using  Bessel  function 
expansions  have  been  developed  to  verify  the  accuracy  of  these  models.  This  Bessel  function 
method  has  been  enhanced  to  be  able  to  verify  simulations  of  objects  in  lossy  materials. 

II.  Problem  Description 

The  basic  simulation  is  described  in  figure  1.  The  problem  space  is  divided  up  into  a 
scattered  field  and  total  field.  An  incident  plane  wave  is  generated  at  one  end  and  subtracted  out  the 
other  end.  Therefore,  anything  that  goes  outside  the  boundary  must  be  the  result  of  a  scattered 
wave.  Note  that  two  or  more  different  background  media  may  be  used.  Figure  1  illustrates  the  case 
where  the  incident  wave  originates  in  free  space,  and  penetrates  into  the  ground.  The  boundaiy^  of 
the  problem  space  is  surrounded  by  a  perfectly  matched  layer  (PML).  It  is  a  slight  modification  to 
the  Berenger  PML  which  is  completely  impervious  to  the  background  medium.  It  will  be  described 
in  the  following  section. 

The  object  in  the  total  field  is  the  target  object  which  is  being  sought.  The  model  for  this 
object  is  loaded  from  an  external  file,  which  is  specified  at  run  time.  These  models  can  be  generated 
'off-line,'  and  can  be  modified  or  even  created  just  using  a  text  editor.  (This  borrows  from  previous 
experience  in  hyperthermia  treatment  planning  where  a  model  of  a  patient  is  created  and  stored,  and 
then  read  in  by  the  program  at  run  time  [4].) 
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Figure  1.  Problem  space  with  a  rectangular  scattered 

The  incident  field  can  also  be  generated  at  any  angle  between  zero  and  ninety  degrees,  as 
illustrated  in  figure  2.  However,  this  is  only  valid  for  a  single  background  The  incident  field  can 
also  be  generated  at  any  angle  between  zero  and  ninety  degrees,  as  illustrated  in  figure  2. 
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in.  PML 


The  absorbing  boundary  conditions  for  this  FDTD  approach  is  handed  by  a  perfectly 
matched  layer  (PML)  which  is  a  slight  modification  of  the  original  developed  by  Berenger  [5].  The 
details  have  been  given  elsewhere  [6],  and  will  only  briefly  by  described  here. 

For  the  purpose  of  the  FDTD  formulation,  the  Maxwell's  equations  are  take  in  the  form 


1 


-  1 — 

(l.a) 

dt 

D(o})  =  e* (co)‘  Ei(o) 

(Lb) 

(l.c) 

dt 

The  type  of  formulation  in  equations  1  is  used  because  dispersive,  or  even  nonlinear, 
properties  of  the  material  are  all  contained  in  equation  l.b.  (Assuming  a  non  magnetic  material). 
Numerous  approaches  have  been  taken  to  solve  E  from  D  in  l.b  ,  however,  that  is  not  germane  to 
the  present  discussion.  What  is  important  is  that  the  dielectric  properties  of  the  material  being 
simulated  are  expressed  in  equation  l.b.  Regardless  of  how  complex  the  material  is,  it  will  not 
impact  the  implementation  of  the  PML  in  equations  l.a  and  l.c. 

Of  the  six  differential  equations  represented  within  equations  l.a  and  l.c,  the  method  will  be 
illustrated  by  using  only  the  D  and  H  components  in  the  z  direction 


A.  1 

dt  \  dx  dy) 

(2.a) 

A  = 

(2.b) 

II 

1 

o  1 

•S’  (S. 

(2.C) 

Berenger's  approach  was  to  introduce  fictitious  electric  and  magnetic  directional  conductivites,  and 
to  split  the  E  and  H  fields  in  each  direction  into  sub  components  depending  on  the  direction  of  the 
spatial  derivative  used  to  calculate  it.  We  will  deviate  from  this  only  slightly  by  introducing  fictitious 
conductivites  associated  with  H  and  D,  instead  of  H  and  E 
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These  fictitious  conductivities  are  in  addition  to  any  "real"  conductivity  associated  with  the  complex 
portion  of  e/fco ) .  Putting  equations  3. a  and  3.d  in  an  FDTD  formulation  (3.b  and  3.e  would  have 
the  same  form): 


+  1/ 2)  =  gj(j)  j,k  +  \l2) 

+  0. 5  y  + 1  /  2,  A:  + 1  /  2)  -  j-l/2,k  +  \/2) 

+  H^’‘^^^aj  +  l/2,k+\/2)-H^''^'^^(i,j-V2,k  +  l/2)) 

///"‘'Vi  + 1/  2,  ;•  + 1  /2,A:;  =  +  l/2,j  +  l  /2,k) 

-0.5-(E;(i  +  l,j  +  l/2,k)-E;(iJ  +  l/2,k)) 

The  PML  is  formed  from  the  parameters 

T-o^,(i) 


gi(i)  =  \i~ 


Ji(i+l/2)=\l- 


T^aJi  +  l/2) 
Aio 


(4.a) 


(4.b) 


(5.a) 


(5.b) 


In  the  Berenger  formulation,  values  of  and  cr^  are  increased  as  one  goes  further  into  the 
PML  to  give  greater  absorption.  Therefore,  the  terms  fl  ~T-o^J  and  {\-T'  \i^)  will 

become  increasingly  smaller,  so  gj,  and  Ji  go  from  1  at  the  edge  of  the  PML,  to  a  smaller  number  at 
the  outer  boundary  of  the  FDTD  space.  (Using  .5  as  the  smallest  value  has  been  found  to  be  the 
most  effective  without  leading  to  instabilities.)  For  instance,  if  a  PML  of  n  layers  were  being 
developed  for  the  X  direction  at  the  lower  end,  the  values  of  gi  and  fi  would  be  computed  as 

gzYi)  =  1-0.5- 

\  n 


\  n 


i=0,l,..,n 

I 

i=0,l,..,n-l 


(6.a) 

(6.b) 


The  values  at  the  high  end  would  be 


SHimax  ~  0=  1-0.5 


\  , 


i=0,l,.,,n 


fi(L 


,-z  +  l/2)  =  l-0.5-f”~^~‘^)  i=0,l,..,n-l 


(7.a) 

(7.b) 


This  PML  has  been  found  to  be  extremely  effective,  usually  requiring  no  more  than  four 
cells.  Because  it  is  implemented  throughout  the  problem  space,  it  has  slightly  higher  computer 
memory  requirements.  However,  it  because  it  is  completely  impervious  to  the  medium  of  the 
problem  space,  it  presents  a  substantial  advantage  in  simulations  where  the  background  medium  is 
often  changed. 
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IV.  Comparison  with  Bessel  Function  Expansion 

The  configuration  illustrated  in  figure  3  is  used  to  evaluate  the  accuracy  of  the  method  by 
comparison  with  Bessel  function  expansion.  The  Bessel  program  is  capable  of  operation  in  lossy 
media.  The  background  medium  has  a  dielectric  constant  of  10  and  a  conductivity  of  .02.  The 
sphere  is  2  meters  in  diameter.  The  inner  core  is  metal  with  a  40  cm  diameter.  The  next  layer  is  2 
cm  thick  and  has  a  dielectric  constant  of  50  and  conductivity  of  .2.  The  outer  layer  has  a  dielectric 
constant  of  20  and  a  conductivity  of  .02.  (Values  were  chosen  to  test  the  method  and  do  not 
necessarily  correspond  to  likely  physical  materials.)  The  results  along  the  incident  axis  are  shown 
in  figure  4  for  1,  10  and  30  MHz.  The  plane  wave  is  in  impulse  and  the  results  are  evaluated  by  a 
running  Fourier  transform,  so  information  for  multiple  frequencies  is  available  for  one  run  [7]. 
Clearly,  there  is  good  agreement  for  all  frequencies. 

A  similar  test  is  made  by  evaluating  the  scattering  from  a  sphere  at  various  levels  through  or 
in  front  of  the  sphere,  as  illustrated  in  figure  5.  This  is  potentially  of  more  significance,  because  the 
purpose  of  the  program  is  to  evaluate  reflected  waves.  Results  are  shown  in  figure  6  for  scattering 
off  a  metal  sphere,  w'here  the  comparison  is  made  through  the  plane  of  the  sphere  and  10  cm  in  front 
of  the  sphere. 

V.  Discussion 

A  three  dimensional  simulation  program  has  been  described  which  is  being  used  towards  the 
development  of  an  electromagnetic  method  of  locating  buried  waste.  Emphasis,  thus  far,  has  been 
on  developing  the  flexibility  of  the  programs  and  verifying  their  accuracy.  It  has  been  shown  that 
these  programs  can  operate  with  arbitrar>’  background  media,  and  that  objects  of  any  size  and  shape 
can  be  developed  outside  the  program,  and  then  read  in  at  run  time.  Similarly,  analytic  programs 
using  Bessel  function  expansions  have  been  developed  which  can  determine  the  scattering  from 
layered  spheres  in  any  background  medium. 
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Figure  3.  Configuration  used  to  test  the  FDTD  program 
against  analytic  programs  using  Bessel  function  expansions. 
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Figure  4.  Results  of  FDTD  vs.  Bessel  function  for  a  sphere  in  a  lossy 
background  of  eps  =10,  sigma  =  .02.  The  sphere  has  a  metal  core  and  two 
dielectric  layers.  Results  are  shown  at  1,  10,  and  30  MHz.  The  FDTD  program 
was  in  a  40^  problem  space  and  required  500  time  steps .  The  cells  were  10  cm^. 
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Figure  5.  Diagram  of  sample  problem 
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Figure  6.  FDTD  vs.  Bessel  Function  for  a  metal  sphere,  10  cm  in  diameter  at  1 
and  100  MHz.  Comparison  made  at  the  axis  perpendicular  to  the  incident  field  10 
cm  in  front  of  the  sphere. 
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ABSTRACT 

The  paper  presents  there  different  fast  implementations  of  a  time  domain  diakoptics  technique  applied  to  the 
modeling  of  the  highly  dispersive  ABC.  The  algorithms  discussed  are  based  on  the  digital  filters  using  low  order 
zero-pole,  Laguerre  and  Kautz  models.  The  performance  of  the  most  successful  designs  is  compared  with  the 
Berenger’s  perfectly  matched  layer. 

I.  INTRODUCTION 

Time  domain  numerical  methods  using  finite  differencing  scheme  in  the  modeling  of  uniform  infinite  structures 
require  the  proper  termination  of  computational  space  in  the  plane  where  no  physical  boundary  exists.  Recently  a 
significant  breakthrough  in  this  area  has  been  achieved  by  Berenger  [1]  who  proposed  termination  of  the  compu¬ 
tational  space  with  a  non-physical  lossy  medium  called  the  perfectly  matched  layer  (PML).  It  has  been  found  [2]. 
however,  that  although  the  Berenger’s  perfectly  matched  layer  performance  is  excellent  for  propagating  waves,  it 
deteriorates  drastically  for  evanescent  waves.  This  limits  the  applicability  of  the  PML  to  the  analysis  of  scattering 
phenomena  in  cylindrical  waveguides,  where  nonpropagating  modes  can  be  excited  at  discontinuities. 

In  the  contribution  we  will  discuss  the  application  of  various  structures  of  digital  filters  to  modeling  the  ter¬ 
mination  condition  in  cylindrical  waveguides.  We  found  that  for  the  propagating  waves  the  digital  filters  offer  the 
performance  equal  or  for  some  filter  configurations  even  better  than  PML,  and  yet  remain  effective  in  absorbing 
cutoff  modes. 

The  digital  filters  use  an  idea  known  as  the  time  domain  diakoptics  [3,  4,  6].  Time  domain  diakoptKs  consists 
in  finding  the  impulse  response  of  a  section  of  the  waveguide  and  using  the  convolution  integral  to  predict  value  of 
the  field  at  the  ABC  from  the  value  of  the  field  at  the  input  of  this  section.  This  relation  is  written  in  the  form 

r{t)  =  m*h{t)  (1) 

where  *  is  the  symbol  for  the  convolution  integral  and  h(t)  is  the  impulse  response  of  the  system,  f{t)  is  the  field 
value  at  the  input  of  the  waveguide  section  and  r(f)  is  the  required  field  value  at  the  ABC.  The  numerical  cost  of 
calculating  the  convolution  is  high  (at  least  a  few  hundreds  of  floating  point  operations  per  iteration  step).  The 
synthesis  of  the  ABC  in  the  form  of  a  digital  filters  is  in  fact  is  a  fast  implementation  of  the  time  domain  diakoptics 
method.  In  a  digital  filter  the  convolution  is  replaced  by  fast  recursive  formulae  so  that  the  overall  computing  time 
of  the  time  domain  analysis  can  be  greatly  reduced. 

II.  DIGITAL  FILTER  ALGORITHMS  FOR  TIME  DOMAIN  DIAKOPTICS 

Digital  filters  can  be  realized  in  many  forms.  In  this  contribution  we  discuss  several  implementations  of  digital 
filters  which  stem  from 

1.  non  orthogonal  filters  such  as  zero-pole  (ARX) 

2.  orthogonal  filters  using  Laguerre  and  Kautz-Laguerre  basis 
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1.  Zero-pole  filters 

^  -io„„  f„„c«„„.  The 


''(^)  —  ^  ^if(k  —  i)  —  ^  air(k  -  i) 


tr£=.xri:£=irS“^^^^^ 

2.  Orthogonal  filters 

orthlormaUnTr’  “■^bi-ation  of  basis  functions  d,((), 


h(t)  =  J2ci4>i(t) 


P-duct  of 

to  the  one  discussed  for  zero-pole  filter,  can  provide  the  expansion  coefficients 

e  .-transfer  function  of  the  filter  having  the  impulse  response  hit)  expressed  in  the  form  (3)  is 


where  $,(r)  denotes  the  rr-domain  operator  corresponding  to  the  basis  funriinn  ^  M  t  .•  i 

''’”i:roX®b;::‘'"b.r  r  “^-/obtain*  it:-.r:cd:r:;p"<:;i:Zf“'^ 

recently  been  shosvn  there  is  an^nflnitUrmberTf  TotsibkoiThc^gm^  '’“f  '* 

the  bases  involving  Laguerre  and  Kant7  nnK-nnmittlc  tx  u  \  frequently  used  are 

estimated  parameters  without  increasing  the' computation aUorpI^itVofX'^^^^^^^^ 

tolX  ^PP-imations  of  biters  ndth  real  po.es^ wh.ie  KautTiwi  "0^  ftll^^s 

2.1  Laguerre  basis 

The  z-transform  of  the  Laguerre  polynomial  of  the  f-th  order  is  given  by 

sltTaCf  Z  Z^ZZtlTZZtZL'’'^  p-“- 

Since  the  Laguerre  polynomials  are  orthogonal  then,  using  representation  (3),  the  convolution  (I)  becomes 

00 

■■(<)  =  Eci/(t).i,(t;e)  (gj 

ZZZtZTZ"  »'■(')  =  /«)  •^.■(«)  Vdi)  a.  the  time  instant 


yi(m)  =  ^yi(m-  1)-|- 

y.  (m)  =  xi(m)  -  ^Xi(m  +1)  t  =  2, 3  . . 
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with 


(9) 


Xj(m  +  1)  i  =  2,3  . . . 


2.2  Kautz  basis 

In  this  model  the  Kautz  functions  are  used  as  expansion  functions.  The  2-transform  of  Kautz  functions 
ICi{z-,a,0)  of  the  f-th  order  is  given  by 


fCi{z-,a,l3)  = 


JCiiz;a,l3)  =  - 


(10) 

z^  -t-  I3{a  -  1)2  -  a  J 

■— ’  +  +  ieve„ 

(11) 

2^  +  0{q  -  1)2  -  a  J 

for  parameters  a,/?  G  (-1, 1).  Like  in  the  previous  model,  the  parameters  a  and  13  provide  additional  degrees  of 
freedom  which  allows  one  to  find  a  best  approximation  for  a  given  filter  order. 

The  response  of  the  Kautz  filter  is  given  by; 


*  Ki{t;a,/3)  (L2) 

1=1 

Again,  the  convolution  terms  can  be  computed  recursively.  Denoting  yi(i)  -  f{t)*Ki{t-,  a,/3),  we  have  the  following 
formulae  for  yi{t)  at  the  time  instant  t  =  mAt 

yi(m)  =  +  (1^) 

t/j(m)  =  xf  ^(m) -f  (a  -  l)/?a;p^(m)  -  -I- 1)  i  =  3,4...  (15) 

The  auxiliary  state  variables  x^^xp^  are  defined  as 

x^i^^(m-l-l)  =  xf^(m)  (16) 

+  1)  =  ax^i^^m)  -  (a  -  l),5xf  ^(m)  -f  \/l  -  a-f{m)  (17) 

4\m  +  l)  =  (18) 

x^'l(m-bl)  =  ax^'V)  -  (a  -  l)^xf\m)  +  ^/il  -  a2)(l  -  /?2)/(m)  (19) 

a;P(r,2  +  l)  =  xpl(m)  i  =  3,4...  (20) 

xpl(m-l-l)  =  axpi(m) -(a- l)/?xpl(m)-l-j/i_2(m)  i  =  3,4...  (21) 

III.  IMPLEMENTATION  OF  THE  MODELS 


The  application  of  the  digital  filters  to  the  modeling  of  dispersive  phenomena  involves  two  steps: 

1,  finding  the  model  parameters,  usually  off  line,  prior  to  simulation 

2.  calculation  of  the  convolution  in  the  actual  time  domain  simulations  using  the  digital  filter  model  of  the 
dispersive  problem 

The  first  step  may  be  regarded  as  a  form  of  tuning  of  the  model.  As  indicated  in  the  previous  sections,  the  model 
parameters,  for  the  investigated  class  of  filters,  can  either  be  found  directly  from  a  known  impulse  response  or  using 
the  least  squares  approach.  We  found  that  the  latter  strategy  is  very  efficient  for  the  modeling  of  dispersive  ABC  in 
waveguides.  Prior  to  simulation  we  have  recorded  short  time  sequences  at  two  points  along  waveguide  separated  by 
a  discretization  step  Ad.  These  sequences  were  used  for  ’’tuning”  the  filters.  We  used  the  least  squares  method  to 
find  filter  coefficients.  Once  the  filter  coefficients  have  been  found  the  quality  of  the  model  was  tested  by  exciting  the 
filter  with  the  input  ’’tuning”  sequence  and  comparing  the  filter  output  with  the  recorded  output  tuning  sequence. 
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Parameters  for  the  Laguerre  and  (a,  /?)  for  Kautz  structures  were  found  by  means  of  optimization  procedure 
using  the  Is  error  of  the  tuning  process  as  a  goal  function.  The  model  parameters  were  saved  and  subsequently 
used  for  modeling  ABC  using  the  excitation  signal  which  was  totally  different  from  the  ’’tuning”  signal. 

The  numerical  cost  of  implementing  the  convolution  using  the  digital  filters  depends  on  the  filter  structure  and 
order.  The  costs  of  the  three  models  discussed  in  this  paper  are  compared  in  Table  1. 


Table  1:  Numerical  cost  and  memory  requirements  for  computation  of  convolution  integrals  using  digital  filter 
models  of  order  Imax 


Model 

Zero-pole 

Laguerre 

Kautz 

Number  of  multiplications 

2fmor  +  1 

^Imax  ~  1 

4/moa: 

Number  of  additions 

Imax 

3Inai  ~  2 

5Imax  3 

Number  of  memory  cells 

2Imax  +  2 

^Imax  +  4 

3Lmax  ■+■  1 

It  is  assumed  that  the  filters  have  the  same  order  Imax,  where  Imax  is  the  number  of  expansion  terms  for 
Laguerre  and  Kautz  models  or  the  number  of  zeroes  and  poles  for  the  zero-pole  model.  It  is  seen  that  the  lowest 
cost  in  terms  of  memory  and  computing  time  is  associated  with  a  zero-pole  filter.  The  advantage  of  orthogonal 
filters  is,  however,  the  ability  to  optimize  their  performance  by  means  of  the  free  parameters  and  the  unconditional 
stability  of  their  impulse  response. 


rv.  RESULTS 

As  an  example  of  the  application  of  the  fast  diakoptics  algorithms  we  shall  present  the  numerical  results  obtained 
for  for  modeling  the  ABC  in  TEiq  mode  in  a  10.16  by  22.86mm  rectangular  waveguide.  First,  a  wideband  behavior 
of  the  models  was  examined.  The  filters  were  tuned  using  a  signal  bandwidth  of  30  GHz  and  a  center  frequency 
15  GHz.  Table  2  shows  the  lowest  values  of  the  squared  filter  error  which  were  obtained  during  the  tuning  phase. 


Table  2:  Square  of  the  final  prediction  error  relative  to  output  tuning  sequence  obtained  at  the  end  of  the  tuning 
stage  for  filters  tuned  for  wide  band  (0-30GHz)  operation  ° 


Model  order 

i  M^dii  i 

zero-pole 

Laguerre 

Kautz 

2 

2.8- 10-” 

8.6  ■  10-” 

"1.4  ■ 10-” 

4 

“7.2  - 10“'’ 

1  ■  10-*= 

1.2-  10-' 

8 

3 ■ 10-” 

3- 10-' 

8.6  -  10-“ 

16 

3.2  10-^ 

3.6 -lO-^ 

'  2.6  ■  10-“ 

32 

~4  ■  10-« 

i-io-« 

8.7-10-'’ 

It  is  seen  that  for  the  low  order  filters  the  lowest  errors  are  obtained  for  Kautz  filters  but  differences  between 
the  models  are  not  very  significant.  The  tuning  phase  can  not  provide  the  answer  how  the  model  behaves  for  other 
sigiials.  To  examine  the  performance  of  the  digital  filters  as  an  ABC,  the  model  parameters  corresponding  to  the 
optima  found  at  the  tuning  stage  were  used  for  time  domain  simulations  of  a  infinitely  long  waveguide.  The  guide 
was  excited  with  a  wideband  Gaussian  pulse  (0-60GHz).  Note  that  this  signal  was  entirely  different  from  tuning 
sequence.  In  order  to  measure  of  the  model  performance  we  have  calculated  the  return  loss  based  at  8196  time 
samples  taken  a  few  cells  from  the  ABC.  Figs.1-2  show  the  return  loss  for  the  zero-pole,  Laguerre  and  Kautz  filters 
for  the  filter  order  Imax  =  4  and  Imax  =  16  respectively.  It  is  seen  that  even  for  the  low  approximation  order  the 
results  are  good  as  the  return  loss  is  lower  than  -50,  -60  dB  over  a  very  wide  frequency  band,  with  the  best  results 
obtained  for  the  Kautz  filters.  The  worst  performance  is  seen  near  the  cutoff  frequency,  especially  for  a  low  order 
zero-pole  filter.  Increasing  the  approximation  order  the  quality  of  the  ABC  improves  quickly,  and  the  differences 
between  models  disappear.  It  has  to  be  noted  that  in  practical  simulations  the  most  interesting  is  the  behaviour 
of  the  ABC  m  the  single  mode  band.  For  this  reason  we  created  a  new  set  of  models  optimizing  the  filters  for  the 
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narrow  band  operation.  To  this  end  we  used  a  tuning  signal  with  the  bandwidth  form  6.9  to  13.lGHz.  The  error 
norms  obtained  in  this  process  are  given  in  Table  3.  The  results  for  Kautz  and  Laguerre  filters  are  quite  remarkable. 

Table  3:  Square  of  the  final  prediction  error  relative  to  output  tuning  sequence  obtained  at  the  end  of  the  tuning 
stage  for  filters  tuned  for  narrow  band  (6.9-13.1  GHz)  operation 


Model  order 

Model  i 

zero-pole 

Laguerre 

Kautz 

2 

00 

o 

1.1  10-° 

9.1  ■  10-° 

4 

2.9  10-° 

1.3-10-^ 

1.1-  io-« 

6 

1  •  10-'^ 

2.2-10-“^ 

o 

8 

9 • 10-^ 

1.1 -lo-* 

2.5  - 10"^ 

For  the  filter  orders  as  low  as  4  or  even  2,  for  the  Kautz  model,  the  prediction  errors  are  by  two  or  even  three  orders 
of  magnitude  lower  than  for  the  zero-pole  model.  The  quality  of  this  tuning  was  again  verified  by  calculating  the 
return  loss  of  the  ABC  in  the  single  mode  operation  band  of  the  waveguide.  The  results  are  shown  in  figs  3  and 
4  ■j'he  Kautz  filter  clearly  provides  the  best  results.  For  the  filter  order  4  it  provides  on  average  more  than  15  dB 
improvement  over  zero-pole  and  Laguerre  filters  especially  for  the  lower  frequencies.  Also,  for  this  filter,  increasing 
the  filter  order  form  2  to  4  results  in  approximately  20  dB  improvement  in  the  performance. 

Fig  5  shows  the  comparison  of  the  performance  of  the  PML  and  two  orthogonal  digital  filters  based  on  Laguerre 
and  Kautz  polynomials.  This  time  the  digital  filters  were  tuned  for  the  2-20GHz  operation.  The  Berenger’s  perfectly 
matched  layer  wsis  assumed  to  consist  of  16  layers  and  have  parabolic  profile  of  the  conductivity  and  normal 
incidence  reflection  factor  of  10“^.  For  the  digital  filters  the  model  order  was  16.  For  these  parameters  the  Berenger’s 
ABC  performs  very  poorly  below  cutoff  because  of  the  short  length  of  the  PML  It  is  seen  that  below  cutoff,  both 
types  of  digital  filters  give  a  significant  improvement  over  the  PML,  while  giving  a  similar  level  of  reflection 
coefficient  for  higher  frequencies.  The  Kautz  model  also  outperforms  PML  in  the  high  dispersion  region. 

V.  CONCLUSIONS 

Fast  algorithms  for  time  domain  diakoptics  were  presented.  There  different  implementations  using  low  order 
zero-pole,  Laguerre  and  Kautz  digital  filters  were  described  and  compared.  A  highly  dispersive  absorbing  boundary 
condition  is  used  as  an  example  of  application.  It  was  found  that  very  good  results  can  be  obtained  at  a  marginal 
numerical  cost.  Generally,  the  orthogonal  filters  seem  to  be  superior  for  low  order  models  with  the  Kautz  filter 
providing  particularly  encouraging  results  for  single  mode  applications.  The  Kautz  filters  also  clearly  outperform 
the  PML. 
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Figure  1:  The  return  loss  of  the  ABC  for  a  TE^o  mode  in  a  10.16  by  22.86mm  rectangular  waveguide  and  the  filter 
order  Imax  =  4.  Models  optimized  for  wideband  operation 
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Figure  2:  The  return  loss  of  the  ABC  for  a  TEi^  mode  in  a  10.16  by  22.86mm  rectangular  waveguide  and  the  filter 
order  Imax  =  16.  Models  optimized  for  wideband  operation 


Figure  3;  The  return  loss  of  the  ABC  for  a  TEiq  mode  in  a  10.16  by  22.86mm  rectangular  waveguide  and  the  filter 
order  Imax  =  2.  Models  optimized  for  operation  within  the  single  mode  operation 
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Figure  4:  The  return  loss  of  the  ABC  for  a  TE\q  mode  in  a  10.16  by  22.86mm  rectangular  waveguide  and  the  filter 
order  Imax  =  4.  Models  optimized  for  operation  within  the  single  mode  operation 


Figure  5;  Comparison  of  the  performance  of  the  16-cell  PML  having  R(0)=10-‘*  and  a  parabolic  conductivity 
profile  with  the  the  orthogonal  digital  filters  for  a  TEio  mode  in  a  10.16  by  22.86mm  rectangular  waveguide.  The 
filters  order  was  assumed  to  be  I^ax  =  16 
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Note  on  Large  Crane  Coupling  to  Nearby  AM  Radio  Station 


Paul  W.  Leonard  &  J.  B.  Hatfield 
Hatfield  &  Dawson  Consulting  Engineers 
2226  Sixth  Avenue  N.  W. 

Seattle.  WA  98107 


ABSTFIACT:  The  measured  hook  current  of  a  large  cargo  crane  near  a  50-kilowatt  medium  wave 
antenna  system  is  compared  to  the  predicted  current  by  the  MINI  NEC  PROFESSIONAL.  Good 
correlation  was  obtained. 


INTRODUCTION:  Large  cargo  container  cranes  (Figure  1)  are  frequently  located  near  AM  radio 
stations.  Longshoremen  and  other  crane  workers  can  be  exposed  to  RF  shocks  or  burns  when 
they  come  into  contact  with  a  crane  hook  as  shown  in  Fig.  2.  This  office  was  recently  involved  in 
the  measurement  of  crane  hook  human  contact  current  at  a  port  terminal  located  adjacent  to  a 
standard  broadcast  AM  station  that  operates  with  50  KW  antenna  input  power.  The  effectiveness 
of  crane  hook  insulators  at  reducing  the  human  exposure  to  RF  crane  hook  currents  was  also 
measured  and  computations  of  crane  hook  insulator  effectiveness  were  made  using  a  beta  test 
version  of  the  MININEC  Broadcast  Professional  program. 

RF  CRANE  HOOK  CURRENT  MEASUREMENTS:  The  hook  current  was  measured  using  an  ANSI 
average  human  equivalent  impedance  (360  -j30  ohms  at  1000  kHz).  The  ANSI  average  human 
impedance  is  the  impedance  of  the  current  path  to  ground  that  is  created  when  a  person  contacts 
the  crane  hook.  The  average  human  impedance  is  shown  as  a  function  of  frequency  in  the 
appendix  to  lEEE/ANSI  C95.1-1992,  “Standard  for  Safety  Levels  with  Respect  to  Human 
Exposure  to  Radio  Frequency  Electromagnetic  Fields,  3  kHz  to  300  Ghz”.  The  contact  current 
measurement  device  is  called  the  "ANSI  MAN".  The  "ANSI  MAN”  simulates  the  electrical 
characteristics  of  a  human  in  the  circuit.  It  consists  of  a  plastic  tube,  5-feet  tall  with  an  impedance 
of  360  -j30  ohms  and  a  radio  frequency  ammeter.  The  "ANSI  MAN"  was  connected  between  the 
crane  hook  and  ground.  The  current  was  measured  as  shown  in  Fig.  3. 
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THE  MININEC  CRANE  MODEL;  The  cargo  crane  dimensions  as  estimated  from  a  photograph 
have  a  base  that  is  about  18  meters  in  height  and  12  meters  square.  On  top  of  that  is  an  arm 
about  46  meters  long  protruding  upward  at  an  angle  of  about  60  degrees.  The  crane  was  modeled 
as  two  cylinders  and  a  wire  to  the  hook  at  ground  level  as  shown  on  Figure  4. 

The  current  measured  through  the  "ANSI  MAN"  was  0.9  amperes.  Since  the  original  MININEC 
model  dimensions  were  estimated  from  a  photograph  of  the  crane  and  the  computed  current  was 
1.3  amperes,  the  height  of  the  crane  was  adjusted  from  57-meters  to  50-meters  so  that  the 
computed  contact  current,  at  0.96  amperes,  was  close  to  the  measured  current.  The  MININEC 
model  ignored  stacks  of  containers,  light  poles  and  other  nearby  equipment  associated  with  an 
active  port.  Also,  the  crane  was  modeled  as  two  straight  cylinders  since  the  detailed  structural 
components  are  small  compared  to  a  wavelength. 

COMPUTATIONS  OF  CRANE  HOOK  INSULATOR  EFFECTIVENESS:  The  MININEC  Broadcast 
Professional  computer  program  was  used  to  predict  the  effectiveness  of  a  crane  hook  insulator  that 
was  installed  to  reduce  the  human  contact  currents  caused  by  the  nearby  AM  radio  station. 
Contact  current  is  inversely  proportional  to  the  total  impedance  seen  by  the  crane  Thevenin 
equivalent  circuit  voltage  source.  The  total  impedance  seen  by  the  crane  voltage  source  when  the 
hook  is  insulated  is  the  sum  of  the  crane  internal  impedance,  the  impedance  of  the  hook  insulator 
and  the  ANSI  average  human  equivalent  impedance  (360  -j30  ohms  at  1000  kHz). 

The  total  impedance  is  strongly  affected  by  the  summation  of  the  reactive  component  of  the  crane 
internal  impedance  and  the  hook  insulator  reactance.  If  the  hook  insulator  reactance  is  opposite 
in  sign  to  the  reactance  of  the  crane  internal  impedance  it  is  possible  for  the  crane  insulator  to  have 
the  effect  of  increasing  the  crane  hook  contact  current.  The  crane  hook  contact  current  will  be 
increased  if  the  hook  insulator  capacitive  reactance  is  less  than  twice  the  magnitude  of  the 
inductive  reactance  of  the  crane  internal  impedance.  Hook  insulators  designed  for  use  at  60  Hz 
power  line  frequencies  have  low  capacitive  reactance  at  higher  frequencies  and  may  exacerbate 
crane  hook  RF  contact  current  problems. 

To  determine  the  effectiveness  of  the  hook  insulator  it  is  necessary  to  compute  the  Thevenin 
equivalent  circuit  impedance  of  the  crane  as  a  receiving  antenna,  Figure  5.  The  open  circuit 
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voltage  and  short  circuit  current  can  be  easily  computed  from  the  radio  station  tower  and  crane 
model.  The  open  circuit  voltage  is  computed  from  the  current  through  a  1  megohm  resistance  load 
from  the  hook  to  ground.  The  short  circuit  current  is  just  the  current  in  the  grounded  hook.  Since 
the  program  provides  the  current  as  a  polar  number  the  ratio  of  the  open  circuit  voltage  to  the  short 
circuit  current  yields  the  complex  internal  impedance  of  the  crane.  If  the  crane  source  voltage  is 
independent  of  the  load  on  the  hook  the  reduction  in  crane  hook  contact  current  is  given  by  the 
ratio  of  the  total  impedance  seen  by  the  crane  source  voltage  without  the  hook  insulator  to  the 
impedance  seen  by  the  crane  source  voltage  with  the  hook  insulator. 


The  short  circuit  current  at  the  hook  of  the  crane  model  was  1.06  amps  at  an  angle  of  -160 
degrees  while  the  open  circuit  current  through  1  megohm  was  3.3  milliamps  at  an  angle  of  -132 
degrees.  The  resulting  internal  impedance  is  3090  ohms  at  28  degrees.  The  total  impedance  with 
the  human  equivalent  impedance  added  is  3402  ohms.  When  a  22  pF  insulator  reactance  and  the 
ANSI  equivalent  human  impedance  are  added  to  the  internal  impedance,  the  total  load  impedance 
seen  by  the  source  voltage  is  6938  ohms.  The  ratio  of  these  two  impedances  gives  the  expected 
reduction  in  contact  current  provided  by  the  hook  insulator  to  49  %  of  the  uninsulated  value.  The 
uninsulated  contact  current  was  measured  Jan  1996  on  the  crane  for  which  a  MININEC  model  was 
constructed.  The  uninsulated  crane  hook  current  to  ground  was  600  mA.  When  the  hook  insulator 
was  installed  the  measured  contact  current  was  300  mA.  The  measured  value  of  contact  current 
with  the  crane  hook  insulator  installed  is  50%  of  the  uninsulated  value. 


CONCLUSION;  Very  useful  current  and  voltage  predictions  can  be  made  using  a  relatively  simple 
model  in  MININEC.  At  higher  frequencies  more  crane  detail  will  be  required.  The  impedance  of 
both  the  hook  insulator  and  the  crane  must  be  known  before  an  insulator  is  used.  It  is  possible  for 
a  hook  insulator  to  have  the  effect  of  increasing  the  crane  hook  RF  contact  current  when  a  crane 
has  a  high  Inductive  reactance. 

The  authors  thank  Steven  S.  Lockwood  for  assistance  in  providing  the  test  data  for  this  paper 
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FIGURE  4 

MININEC  CRANE  MODEL 
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CRANE  THEVENIN  EQUIVALENT  CIRCUIT  SOURCE  VOLTAGE 

CRANE  INTERNAL  IMPEDANCE 

IMPEDANCE  OF  HOOK  INSULATOR 

EQIVALENT  IMPEDANCE  ANSI  AVERAGE  PERSON 

TOTAL  IMPEDANCE  SEEN  BY  CRANE  SOURCE  VOLTAGE 
WITH  HOOK  INSULATED 

TOTAL  IMPEDANCE  SEEN  BY  CRANE  SOURCE  VDLTGAGE 
WITH  HOOK  UNINSULATED 

INSULATED  CRANE  HOOK  CONTACT  CURRENT 
UNINSULATED  CRANE  HOOK  CONTACT  CURRENT 


FIGURE  5 

CRANE  EQUIVALENT  CIRCUIT 
AND  DEFINITION  OF  TERMS 
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Xpatchf  Software  System  Analysis  and  Profiling^ 


B.  A.  Kadrovach,  T  .S.  Wailes,  A.  J.  Terzuoli,  Jr.,  D.  S.  Gelosh 
Air  Force  Institute  of  Technology 


1.  Introduction 

Xpatchf  is  a  powerful  radar  scattering  prediction  software  package  with  proven  accuracy  [1]. 
However,  Xpatchf  requires  a  significant  amount  of  time  to  generate  a  single  prediction  for  an  object  of 
moderate  size.  Therefore,  studies  are  being  conducted  to  improve  the  performance  of  Xpatchf  though 
hardware/software  codesigns  or  implementations  on  parallel  computers.  This  paper  details  the 
functionality  of  Xpatchf  and  describes  the  methods  used  to  identify  and  isolate  the  portions  of  Xpatchf 
that  are  best  suited  for  implementation  in  a  parallel  environment. 

2.  Xpatchf  Functionality 

The  source  code  of  Xpatchf  consists  of  a  mixture  of  FORTRAN  and  C.  The  main  functionality 
of  Xpatchf,  as  it  pertains  to  the  calculation  of  the  scattered  field,  is  written  in  FORTRAN.  C  is  used  to 
provide  dynamic  memory  management  and  ray  tracing  capabilities. 

The  only  abstract  data  type  used  by  the  Xpatchf  FORTRAN  code  is  the  array  -specifically,  a 
fixed-length  array.  Previous  versions  of  Xpatch  required  the  user  to  specify  the  sizes  of  the  various 
arrays  used  before  compilation.  However,  the  incorporation  of  key  C  language  routines  allows  the 
combined  C  and  FORTRAN  program  to  make  use  of  dynamic  length  arrays. 

Object  input  data,  such  as  the  geometry,  are  initially  read  from  an  input  file.  Data  of  similar 
nature  are  then  stored  in  contiguous  memory  blocks.  For  instance,  the  x,  y,  and  z  nodes  of  the  facet 
geometry  are  stored  as  3-tuples  in  an  array.  The  length  is  given  in  the  input  file  and  is  used  to  specify  a 
sufficiently  large  contiguous  block  of  memory  in  which  to  store  the  node  data.  These  arrays  are 
managed  and  maintained  by  the  C  functions.  FORTRAN  subroutines  that  require  access  to  these  data 
elements  as  input  parameters  are  called  by  other  FORTRAN  subroutines  through  a  C  function  call. 
Figure  1  shows  the  relationship  between  the  FORTRAN  subroutines  and  the  C  function  wrapper. 


^This  work  was  supported  by  the  U.  S.  Air  Force  Wright  Laboratories. 


In  calling  Fortran  subroutine: 

cJoo(a,  b,  c) 


In  C  function: 

void  c_foo(AType  a,  BType  b,  CType  c) 

{ 

too  (a.  b,  c,  a_array,  b_array); 

}: 


In  called  Fortran  subroutine: 

foo(a,  b,  c,  a_array,  b_array) 
dimension  a_array(a),  b_array(b) 

Figure  1.  FORTRAN/C  Interface 

The  dimension  of  each  array  may  be  passed  as  a  parameter  or  may  exist  as  a  global  variable.  Global 
variables  are  used  extensively  through-out  the  FORTRAN  code  of  Xpatchf. 

The  execution  of  Xpatchf  occurs  in  three  phases.  The  preprocessing  phase  establishes  initial 
configuration  parameters  and  reads  the  geometry  file  into  memory.  The  second  phase  consists  of  the 
single  bounce  calculation,  and  the  final  phase  consists  of  the  multiple  bounce  calculations.  Each  of  these 
phases  are  described  in  more  detail  in  the  following  sections. 

2.1  Preprocessing 

In  the  preprocessing  phase,  the  configuration  parameters  are  read  in  from  the  input  file.  Since  a 
facet  model  is  assumed  for  the  object,  there  are  two  possible  methods  for  calculating  the  single 
bounce.  [2]  The  Physical  Optics  (PO)  method  is  used  for  smooth  objects  with  little  blockage  while  the 
Z-buffer  method  is  used  for  complicated  objects.  Since  realistic  objects  are  unlikely  to  be  simple  and 
smooth,  it  was  decided  to  assume  that  the  Z-buffer  method  would  be  used  to  calculate  the  single 
bounce.  This  method  will  be  discussed  in  greater  detail  in  the  next  section. 

Preprocessing  continues  with  the  building  of  the  Binary  Space  Partition  (BSP)  Tree.  The  BSP 
Tree  greatly  speeds  the  ray-tracing  process  for  the  multiple  bounce  calculations.  For  complicated 
objects,  the  amount  of  run-time  devoted  to  building  the  BSP  Tree  can  be  significant.  In  the  case  of  the 
VI  aircraft  test  object  [2],  the  time  required  to  build  the  BSP  Tree  was  approximately  13%  of  the  total 
run-time. 

The  BSP  Tree  is  used  to  sort  the  facets  of  an  object.  This  is  accomplished  first  by  calculating  a 
bounding  box  for  the  object  in  three  dimensions.  Then,  the  bounding  box  is  evenly  divided  along  the  x- 
axis  to  form  two  child  boxes  also  known  as  voxels  (volume  elements).  In  general,  any  voxel  that  is 
evenly  divided  becomes  a  parent  to  the  two  children.  At  the  topmost  level,  the  facets  of  the  object  are 
sorted  into  the  child  voxel  that  either  contains  all  or  part  of  the  facet.  Each  child  voxel  becomes  a 
parent  to  two  more  child  voxels.  At  the  second  level,  the  voxels  are  divided  along  the  y-axis.  The  facets 
of  the  parent  voxels  are  then  sorted  into  their  respective  children  voxels.  At  the  third  level,  the  process 
is  repeated  except  that  the  division  of  the  parent  voxels  is  made  along  the  z-axis.  This  divide  and 
conquer  process  is  continued  by  repeating  the  voxel  division  along  the  successive  axes  (x  then  y  then  z) 
until  a  stopping  condition  is  met.  Ihe  user  specifies  the  maximum  number  of  facets  in  the  leaf  voxels. 
This  is  the  stopping  condition.  Only  the  leaf  voxels  contain  the  list  of  facets.  AU  non-leaf  voxels  contain 
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a  null  lists  of  facets.  This  is  critical  to  the  traversal  of  the  BSP  Tree  when  searching  for  a  facet.  The  use 
of  the  BSP  Tree  is  a  trade-off  between  speed  and  memory  requirements.  If  the  maximum  number  of 
facets  allowed  in  a  leaf  voxel  is  small,  the  ray-tracing  (to  be  described  in  detail  later)  will  proceed  at  a 
much  faster  rate.  However,  the  memory  necessary  to  store  the  BSP  Tree  increases  significantly. 

2.2  Single  Bounce  Calculations 

The  single  bounce  calculation  may  be  accomplished  using  either  the  Physical  Optics  (PO)  or  Z- 
buffer  methods.  The  Z-buffer  technique  was  chosen  because  of  its  ability  to  handle  complicated  objects. 
The  Z-buffer  method  is  a  computer  graphics  algorithm  that  finds  all  the  visible  surfaces  from  a 
particular  viewing  angle.  In  the  case  of  Xpatchf,  the  viewing  angle  is  the  angle  of  the  incident  wave. 

Figure  2  shows  the  way  in  which  the  Z-buffer  technique  is  used.  A  viewing  screen  is  created  and 


The  buffer  is  filled  with  the  facet  information  of  the  closest  facet.  Once  the  buffer  is  built,  the  facet 
information  is  used  to  calculate  the  single  bounce. 

2.3  Multiple  Bounce  Calculations 

The  second  and  higher  bounces  of  the  incident  wave  are  calculated  using  the  Shooting- 
Bouncing  Ray  (SBR)  technique.  The  SBR  method  is  used  for  all  of  the  object  geometries  available  to 
the  Xpatchf  user. 

The  SBR  method  is  a  ray-tracing  technique  usually  used  in  computer  graphics  applications. 
Rays  are  fired  at  the  object  from  a  virtual  screen  somewhat  similar  to  the  Z-buffer  screen.  But  rather 
than  building  a  buffer,  each  ray  is  followed  as  it  bounces  within  the  object  until  it  either  exits  the  object 
or  bounces  a  maximum  number  of  times  (specified  by  the  user).  In  normal  operation  of  Xpatchf,  only 
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the  1st  and  last  bounces  of  the  incident  wave  are  used.  Since  the  single  bounce  is  found  by  other  means, 
only  the  last  bounce  of  the  SBR  is  of  interest.  When  a  ray  is  detected  as  leaving  the  object,  its 
contribution  is  calculated  and  the  next  ray  is  then  followed.  The  number  of  rays  fired  at  an  object 
depend  on  its  size  and  the  frequencies  being  simulated.  As  an  example,  if  the  boxfin  [2]  is  to  be 
calculated  for  a  frequency  of  1  IGHz,  approximately  1  million  rays  will  be  fired. 

Of  the  three  main  phases  of  Xpatchf  operation,  the  last  phase  offers  the  greatest  potential  for 
optimization  in  some  form  of  a  concurrent  architecture.  However,  all  three  phases  could  benefit  from 
the  advantages  offered  by  concurrent  operation.  This  will  be  discussed  in  the  next  section. 

3.  Identified  Potential  Concurrencies 

Each  of  the  three  phases  (preprocessing,  single  bounce,  and  SBR)  have  potential  for 
implementation  in  a  concurrent  architecture  since  each  phase  consists  of  one  or  more  loops.  The  largest 
percentage  of  computational  effort  in  the  preprocessing  phase  is  devoted  to  building  the  BSP  Tree. 
Since  the  tree  is  a  dynamic  memory  structure,  it  does  not  lend  itself  well  to  optimization  in  a  parallel 
execution.  The  remaining  two  phases  deserve  a  closer  look.  First,  an  analysis  of  the  Z-buffer 
implementation  will  be  described.  Then,  an  in-depth  investigation  of  the  SBR  method  will  be  presented. 

3.1  Z-Buffer 

The  program  flow  of  the  Z-buffer  source  code  is  best  described  with  an  illustration.  Figure  3 
shows  a  graphical  representation  of  the  Z-buffer  method  as  implemented  in  Xpatchf. 


Figure  3.  Fill-Buffer  Loop 

The  screen  is  divided  into  sections  with  each  section  no  larger  than  600  x  600  pixels.  The  size  of  a  pixel 
is  specified  by  the  user  and  is  relative  to  the  wavelength  being  simulated.  The  relationship  between  the 

number  of  total  pixels,  ipixxtot,  and  the  size  of  the  object  is  given  by  ipixxtot  - - - - .  This 

pixsize  -X 

equation  is  shown  for  the  x  axis  but  the  same  applies  for  the  y  axis.  In  the  above  equation,  pixsize  is 
specified  by  the  user  and  is  relative  to  the  wavelength.  The  wavelength,  X,  is  the  smallest  wavelength 
being  considered  in  the  calculations.  The  size  of  the  object,  Ax,  is  the  size  of  the  projection  of  the  object 
onto  the  Z-buffer  screen.  For  instance,  if  the  largest  frequency  being  considered  is  12.0GHz  (X  = 
0.025m)  and  the  user  specifies  the  pixel  size  to  be  O.R,  the  resulting  total  number  of  x  pixels  is  2,000 
for  an  object  with  a  size  of  5  meters. 
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Once  the  number  of  pixels  is  determined  and  the  number  of  zones  necessary  to  ensure  that  the 
size  does  not  exceed  the  600  x  600  pixel  requirement,  the  Z-buffer  is  filled  with  object  data. 
Specifically,  for  each  zone,  all  facets  of  the  object  are  examined  to  find  their  place  in  the  Z-buffer.  If 
facets  overlap,  the  closest  facet  is  used  to  fill  the  Z-buffer  with  the  appropriate  data.  This  data  includes 
the  facet  identification  (a  unique  integer  is  assigned  to  each  facet),  the  facet  material,  and  the  facet 
normal.  Finally,  with  the  Z-buffer  filled,  Xpatchf  calculates  the  far  field. 

3,2  SBR 

The  Shooting  Bouncing  Ray  (SBR)  method,  used  to  calculate  the  multiple  bounce 
contributions,  is  somewhat  similar  to  the  Z-buffer  manipulation  with  the  exception  of  the  multiple 
bounces  of  the  ray.  As  before,  a  virtual  screen  is  used  to  fire  rays  at  the  object.  TTie  dimensions  of  this 
screen  are  specified  by  size  of  the  object  projection  onto  the  virtual  screen.  The  axes  of  the  screen  are 
elevation  and  azimuth  (rather  than  x  and  y).  Once  the  extents  of  the  screen  are  calculated,  a  grid  is 
established  to  fire  the  rays  at  the  object.  The  number  of  rays  to  be  fired  are  determined  from  a 
combination  of  object  dependent  parameters  and  user  specified  parameters  [2]. 

In  order  to  speed  the  ray  tracing,  the  virtual  screen  can  be  divided  into  user  defined  zones.  A 
typical  zone  size  is  a  square  of  20  x  20  rays.  A  test  ray  is  fired  into  the  center  of  the  zone.  If  the  ray 
intersects  the  object,  all  the  rays  in  the  zone  are  fired,  otherwise  the  zone  is  skipped  saving  400  ray 
computations. 

When  a  ray  is  found  that  intersects  the  object,  it  is  followed  for  a  specified  maximum  number  of 
bounces.  The  default  maximum  is  50,  however  the  user  may  specify  any  value  between  1  and  50.  If  the 
ray  does  not  exit  the  object  before  this  maximum  value  is  reached,  the  ray  is  discarded  and  the  next  ray 
is  traced.  It  is  important  to  note  that  even  though  the  first  bounce  is  calculated  using  a  different  method, 
the  SBR  ray  tracing  routines  must  still  fire  each  ray  and  calculate  the  first  bounce  information.  Figure  4 
illustrates  the  program  flow  for  the  SBR  source  code. 


The  SBR  routine  contains  numerous  conditional  exits  from  each  of  the  loops.  This  complicates  the 
implementation  of  this  code  into  a  VHDL  model  since  application  specific  hardware  does  not  easily 
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handle  conditional  data  flow.  These  conditional  branches  require  a  flexible  model  able  to  manage  the 
changes  in  data  flow. 

The  functionality  of  Xpatchf  was  determined  by  profiling  the  software  in  the  run-time 
environment.  This  was  done  by  tracking  function  and  subroutine  calls  in  both  the  FORTRAN  and  C 
code.  Since  most  of  the  C  code  functions  exist  as  library  functions,  the  blending  of  FORTRAN 
subroutine  calls  with  the  C  code  function  calls  presented  a  major  difficulty.  There  was  no  easy  way  to 
determine  the  temporal  relationship  between  the  FORTRAN  and  C  source  code.  The  only  alternative 
was  a  line-by-line  analysis  of  the  source  code  to  determine  the  subroutine/function  calling  hierarchy. 

In  order  to  profile  the  FORTRAN  source  code,  each  of  its  subroutines  was  changed  so  that  a 
unique  flag  was  written  to  a  file  every  time  that  the  subroutine  was  called.  Then,  special  support 
programs  were  written  to  manage  and  analyze  the  vast  amounts  of  information  produced.  A  table  was 
generated  to  show  the  statistics  for  each  subroutine.  This  data  was  used  to  focus  the  analysis  effort 
toward  those  subroutines  used  most  frequently.  Next,  a  time  plot  was  generated  to  show  the  temporal 
relationship  of  each  subroutine.  These  plots  showed  the  periodicity  of  the  Xpatchf  run-time 
environment.  Since  these  plots  consist  of  several  thousand  lines  of  text  data,  none  are  included  with  this 
paper. 

The  same  method  was  applied  to  the  C  functions.  Some  of  the  C  functions  simply  provided  the 
necessary  dynamic  memory  management  as  explained  earlier.  The  C  functions  of  interest  performed  the 
management  of  the  BSP  Tree  and  the  actual  ray  tracing.  Once  the  source  code  was  analyzed  and  the 
hierarchy  was  understood,  the  temporal  relationship  between  the  FORTRAN  and  C  functions  could  be 
determined.  These  temporal  relationships  were  used  to  identify  the  Xpatchf  functionality  that  would 
achieve  the  most  benefit  from  a  hardware  implementation.  The  results  of  that  analysis  are  shown  in 
Table  1. 


Table  1.  Run-time  Percentages 


Object 

BSP  Generation 

Single  Bounce 

Multi-Bounce 

Boxfin 

0.10% 

11.16% 

87.97% 

Turkey 

0.63% 

2.94% 

95.56% 

VI  Aircraft 

12.85% 

42.37% 

42.17% 

The  times  of  execution  are  shown  as  percentages  of  the  total  time  of  execution.  This  provides  a 
normalized  view  of  the  relative  time  spent  in  each  phase.  It  should  be  noted  that  not  aU  of  the  run-time 
is  accounted  for  in  the  above  table.  The  pre-  and  post-processing  is  not  shown  and  therefore,  the 
percentages  of  each  row  do  not  add  up  to  100%,  The  first  phase  (BSP  generation)  was  small  for  the 
first  two  objects  but  became  significant  for  the  VI  aircraft  due  to  its  complexity.  The  other  two  phases 
(single  bounce  and  multi-bounce  calculations)  showed  wide  variations.  The  run-time  for  boxfin  and 
turkey  indicated  that  most  of  the  time  was  spent  in  the  multi-bounce  portion  of  the  code  while  the  run¬ 
time  for  the  VI  aircraft  showed  about  equal  proportions  of  time  spent  in  the  single  bounce  and  multi¬ 
bounce  calculations.  It  is  hypothesized  that  the  processing  of  the  VI  aircraft  could  be  improved  by 
eliminating  the  single  bounce  calculation  by  Z-buffer  (or  PO)  and  stay  strictly  with  SBR.  The  Single 
Bounce  and  Multi-Bounce  calculations  would  most  benefit  from  a  parallel  implementation  since  they 
require  the  greatest  portion  of  the  run-time. 
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4.  Conclusion 

This  paper  presented  a  method  to  analyze  and  profile  the  Xpatchf  software  system  for  a  parallel 
implementation.  The  approach  was  to  identify,  through  run-time  analysis,  the  portions  of  the  program 
that  repeat.  Data  interdependencies  between  repeated  cycles  of  code  must  be  identified  to  ensure  that 
one  cycle  may  be  executed  independently  of  another.  In  Xpatchf  those  cycles  exist  in  both  the  Single 
Bounce  and  Multi-Bounce  calculations. 
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1.  INTRODUCTION 

Photonic  band  gap  (PBG)  materials  are  periodic  dielectric  structures  exhibiting  a  frequency 
gap  in  which  all  modes  are  forbidden,  analogous  to  the  electronic  band  gap  in  the  case  of 
semiconductors.^"^  In  such  materials,  spontaneous  emission  is  suppressed  for  photons  with  fre¬ 
quencies  in  the  forbidden  region.  Already,  there  have  been  a  number  of  proposals  for  photonic 
band  gap  based  devices  with  operating  frequencies  ranging  from  microwave  to  optical,  including 
zero-threshold  lasers,  novel  resonators  and  cavities,  and  efficient  microwave  antennas. Most  of 
these  applications  rely  on  the  availability  of  structures  with  full  three-dimensional  photonic  band 
gaps. 

In  the  present  paper,  we  are  using  the  transfer  matrix  method^  in  order  to  calculate  the 
transmission  and  reflection  properties  of  PBG  materials.  In  particular,  we  study  defects  in  periodic 
PBG  materials  which  can  be  used  as  filters  and  high  Q  cavities.  We  also  study  PBG  materials  as 
a  substrate  for  planar  dipole  antennas.  Since  PBG  materials  are  perfectly  reflecting  materials  for 
frequencies  inside  the  photonic  band  gap,  one  expects  that  they  will  impove  the  performance  of 
antennas. 

According  to  the  transfer-matrix  method  (TMM),  the  space  is  divided  in  small  cells  and  the 
fields  in  each  cell  are  coupled  with  those  in  the  neighboring  cells;  then  the  transfer  matrix  can  be 
defined  by  relating  the  fields  on  one  side  of  the  material’s  slab  with  those  on  the  other  side.  So, 
the  transmission  and  reflection  coefficients  for  EM  waves  of  various  frequencies  incident  on  a  finite 
thickness  material  can  be  obtained;  the  system  assumed  periodic  in  the  directions  parallel  to  the 
interfaces.  The  method  has  been  used  in  two-dimensionaP  and  three-dimensionaP  structures  as 
well  as  in  cases  where  the  dielectric  constant  is  complex  and  frequency  dependent.^’®  In  all  the 
cases,  the  agreement  between  theory  and  experiment  is  very  good. 

2.  FILTERS 

We  use  a  layer-by-layer  PBG  crystal  which  has  been  designed  and  fabricated  by  the  Iowa 
State  group. The  structure  is  assembled  by  stacking  layers  consisting  of  parallel  rods  with  a 
center-to-center  separation  of  a.  The  rods  are  rotated  by  90®  in  each  successive  layer.  Starting  at 
any  reference  layer,  the  rods  of  every  second  neighboring  layer  are  parallel  to  the  reference  layer, 
but  shifted  by  a  distance  0.5a  perpendicular  to  the  rod  axes.  This  results  in  a  stacking  sequence 
that  repeats  every  four  layers.  This  lattice  has  face-centered-tetragonal  (fct)  lattice  symmetry 
with  a  basis  of  two  rods.  The  photonic  band  gap  is  not  sensitive  to  the  cross-section  shape  of 


436 


Frequency  (GHz) 


Figure  1:  The  transmission  characteristics  of  EM  waves  propagating  in  a  perfect  (a)  and  a 
periodic  with  defect  (b)  layer-by-layer  structure  with  k  perpendicular  to  the  plane  of  the  rods 
(x-y  plane).  The  systems  consists  of  16  layers  of  rods.  The  defect  is  created  by  removing  every 
other  rod  from  the  8th  layer. 

the  rods.  This  structure  has  a  photonic  band  gap  when  both  the  filling  ratio  and  the  dielectric 
contrast  meet  certain  requirements.®"^^ 

In  the  present  case,  we  use  circular  rods  with  dielectric  constant  9-61  and  0.318  cm  in  diameter; 
a=1.123  cm  and  the  filling  ratio  of  around  0.26.  We  define  the  plane  of  each  layer  as  the  x-y  plane. 
Using  the  TMM,  we  calculate  the  transmission  (Fig.  la)  of  a  structure  with  four  conventional 
unit  cells  along  the  z-axis  (16  stacked  layers).  We  use  one  unit  cell  along  the  x-  and  y-axis  with 
periodic  boundary  conditions  (the  system  is  infinite  along  the  x-  and  y-axes).  Each  conventional 
unit  cell  is  divided  into  7x7x8  cells  in  order  to  ensure  good  convergence.  For  both  polarizations, 
there  is  a  gap  between  11.2  and  16.9  GHz.  This  is  in  very  good  agreement  with  the  experimental 
measurments  as  well  as  with  the  results  of  the  plane  wave  (PW)  method®’^®  There  are  differences 
between  the  two  polarizations,  especially  at  high  frequencies.  It  is  worth  noting  that  the  difference 
between  the  two  polarizations  is  due  to  the  surface  of  the  system  (the  rods  on  the  surface  are 
parallel  to  the  z-axis).  Band  structure  calculations  of  a  corresponding  infinite  system  do  not  show 
any  difference  between  the  two  polarizations.  The  transmission  at  the  middle  of  the  gap  is  around 
-80  dB.  Calculations  for  thinner  structures  show  the  drop  of  the  transmission  is  about  -19  dB  per 
unit  cell;  by  using  just  two  unit  cells  (8  layers  of  rods),  we  can  have  a  PBG  with  a  transmission 
of  about  -38  dB  which  is  enough  for  most  of  the  possible  applications. 

Defects  are  created  by  removing  or  adding  rods  into  the  structure.  We  first  investigate  the 
properties  of  defect  structure  made  by  removing  every  other  rod  from  the  8th  layer.  The  system 
is  otherwise  similar  with  the  one  described  previously.  A  peak  in  the  transmission  appears  at 
12.58  GHz.  The  peak  is  sharper  for  the  polarization  where  E  is  parallel  to  the  axes  of  the  removed 
rods  ( in  Fig.  lb);  in  particular,  Q=800  and  the  transmission  at  the  peak  is  -4  dB  for  the 
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Figure  2:  Transmission  and  absorption  for  EM  waves  propagating  in  a  3D  s.c.  lattice  consisting 
of  metallic  tetragonal  rods  connecting  nearest  neighbors  with  f=0.03,  L=  3a  and  8=  0°.  A  supercell 
of  width  2a  has  been  used  with  periodic  boundary  conditions  at  the  edges  of  the  supercell.  Part 
of  the  metal,  which  is  included  in  a  sphere  with  a  center  in  one  of  the  crossing  points  of  the  rods 
at  the  second  layer  and  radius  rj-  has  been  removed.  Solid,  dotted,  and  dashed  lines  correspond 
to  rd/a=  0,  0.15  and  0.5,  respectively. 


wave  with  E//x  while  Q=900  and  the  transmission  at  the  peak  is  -29.7  dB  for  the  wave  with 
E//y.  The  defect  frequency  and  Q-factor  are  not  very  sensitive  to  separation  between  the  defects. 
In  particular,  the  defect  frequency  changes  from  12.58  to  12.68  to  12.39  GHz  as  the  separation 
between  the  removed  rods  increases  from  2a  to  3a  to  4a.  Also,  the  Q-factor  increases  rapidly  by 
increasing  the  thickness  of  the  PBG  crystal.  For  a  defect  structure  with  8  unit  cells  (32  layers) 
made  by  removing  every  other  rod  from  the  16th  layer,  calculations  predict  a  defect  frequency  at 
12.61  GHz  with  a  Q-factor  more  than  1.4  x  10®.  This  suggests  the  possibility  of  building  cavity 
structures  with  very  high  Q-factors  from  this  geometry.  We  can  also  adjust  the  defect  frequency 
by  changing  the  radius  of  the  rods.  In  a  4-unit  cell  system  (16  layers),  we  assume  that  every  other 
rod  in  the  8th  layer  has  radius  instead  of  r.  A  defect  peak  inside  the  gap  appears  when  r^/r 
is  around  0.65.  By  decreasing  rj/r,  the  defects  frequency  increases  and  for  rd/r=0  (this  is  the 
case  where  every  other  rod  has  been  removed;  see  Fig.  lb)  the  defect  frequency  is  12.58  GHz  as 
discussed  previously. 

We  have  also  studied  photonic  band  gap  materials  constructed  by  metallic  components.  We 
use  a  s.c.  lattice  consisting  of  metallic  tetragonal  rods  connecting  nearest  neighbors  (network 
topology)  with  filling  ratio,  f=  0.03,  lattice  constant,  a=1.27  yum,  and  the  surrounding  medium  is 
air.  The  periodic  system  (solid  lines  in  Fig.  2)  has  a  band  gap  from  zero  up  to  a  cut-off  frequency 
of  100  THz.  This  band  gap  appears  for  any  angle  and  polarization.  We  also  study  the  effect  of 
the  introduction  of  defects  in  a  periodic  lattice.  A  supercell  has  been  used  with  width  2a  along 
the  X-  and  y-axis  and  periodic  boundary  conditions  are  imposed  at  the  edges  of  the  supercell;  the 
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system  is  finite  along  the  z-axis  with  thickness  L=  3a.  A  defect  is  introduced  by  removing  part 
of  the  metal,  which  is  included  in  a  sphere  of  radius  Td  centered  at  one  of  the  crossing  points  of 
the  rods  in  the  second  layer.  Figure  2  shows  the  transmission  and  absorption  for  such  a  defect 
structure  and  incident  waves  with  k  parallel  to  the  z-axis  and  a=  1.27  /xm.  Once  again  the  results 
are  identical  for  both  polarizations  due  to  the  symmetry  of  the  structure.  For  r<i/a=0.15,  a  small 
peak  in  the  transmission  appears  at  around  31  THz;  the  quality  factor  is  very  small  (Q=  3)  and 
the  transmission  at  the  top  of  the  peak  is  also  small  (-37.2  dB).  Apart  from  the  frequency  region 
around  the  defect,  the  transmissions  of  the  defect  and  the  periodic  structures  are  almost  the  same 
(compare  dotted  and  solid  lines  in  Fig.  2).  For  rd/a=0.5  (dashed  line  in  Fig.  2),  there  is  a  peak  in 
the  transmission  at  higher  frequency  (60  THz)  which  is  even  wider  and  with  higher  transmission 
at  the  top  of  the  peak.  Thus,  one  can  adjust  the  frequency  of  the  defect  inside  the  gap  by  just 
changing  the  volume  of  the  removed  metal;  the  higher  the  amount  of  the  removed  metal,  the 
higher  the  frequency  where  the  defect  peak  appears.  Studies  in  dielectric  PBG  materials  have 
shown  that  a  defect  band  emerges  from  the  lower  edge  of  the  gap  and  approaches  the  upper  edge 
of  the  gap  as  the  volume  of  the  removed  dielectric  material  increases.  The  behavior  of  the  defect 
band  is  similar  in  the  present  case,  despite  the  fact  that  the  lower  edge  of  the  gap  is  actually 
at  zero  frequency.  The  absorption  for  r<f/a=0.15  is  almost  identical  with  the  absorption  of  the 
periodic  case  except  for  a  small  maximum  at  the  frequency  where  the  defect  peak  appears  (hardly 
noticed  by  comparing  dotted  and  solid  lines  in  Fig.  2).  However,  the  differences  in  the  absorption 
between  the  r^f/a^O.S  and  the  periodic  cases  (compare  dashed  and  solid  lines  in  Fig.  2)  are  more 
obvious  with  a  more  prominent  peak  of  the  absorption  in  the  7’d/a=0.5  case  at  the  frequency 
where  defect  appears  (60  THz).  Since  the  light  is  trapped  around  the  defect  region,  one  actually 
expects  the  absorption  will  be  higher  in  this  case. 

In  conclusion,  dielectric  and  metallic  PBG  materials  can  be  used  as  high  Q  cavities  and  filters 
by  creating  defects  in  those  structures.  The  frequency  where  the  transmission  peak  appears  can  be 
controlled  by  the  size  of  the  defect.  Dielectric  PBG  materials  exhibit  much  higher  values  of  Q  (of 
the  order  of  10^  or  more)  than  the  metallic  PBG  materials.  The  reason  for  that  difference  lies  in 
the  absorption  of  the  metals  and  it  becomes  more  prominent  as  the  frequency  increases. However, 
metallic  PBG  materials  may  be  more  attractive  in  microwave  and  millimeter  wave  aplications  and 
for  cases  where  the  size  and  weight  of  the  material  is  an  important  factor.  One  can  make  the 
metallic  PBG  filters  very  small  in  size  by  using  high  dielectric  material  as  a  background  material 
and  by  decreasing  the  volume  of  the  removed  metal  (rj  in  Fig.  2).^^ 

3.  PBG  AS  SUBSTRATES  FOR  ANTENNAS 

Conventional  integrated  circuit  antennas  on  a  semi-infinite  semiconductor  substrate  (with 
dielectric  constant  e)  have  the  drawback  that  the  power  radiated  into  the  substrate  is  a  factor  e  ^ 
larger  than  the  power  into  the  free  space. Hence,  antennas  on  GaAs  or  Si,  radiate  only  2%  of 
their  power  into  free  space.  Of  the  power  radiated  into  the  substrate,  a  large  fraction  is  in  the  form 
of  trapped  waves  propagating  at  angles  larger  than  the  critical  angle. By  fabricating  the  antenna 
on  a  PBG  material  with  a  driving  frequency  in  the  stop  band,  no  power  should  be  transmitted 
into  the  PBG  material  and  all  power  should  be  radiated  in  the  free  space  assuming  that  there  are 
no  evanescent  surface  modes.  Brown  et  al.^^  demonstrated  this  concept  by  fabricating  a  bow-tie 
antenna  on  their  3-cylinder  PBG  material  and  found  a  complex  radiation  pattern  confined  to  free 
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Figure  .3;  The  transmission  for  s-  and  p-polarized  waves  (a  and  b  panels,  respectively)  for  EM 
waves  propagating  in  a  multilayer  described  in  the  text.  Solid  and  dotted  lines  correspond  to 
incident  angles  of  0  and  60  degrees. 
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Figure  4:  The  radiation  pattern  of  a  dipole  placed  on  top  of  a  multilayer  with  driving  frequency 
of  8  and  6  GHz  (a  and  b  panels,  respectively).  Solid  and  dashed  lines  corespond  to  H-  and 
E- planes. 


space.  Subsequently,  they  improved  the  directionality  of  a  dipole  antenna  by  placing  it  on  high- 
and  low-dielectric  surfaces  of  their  PBG  materials.^®  More  recently,  Cheng  et  al.^’’  have  measured 
the  radiation  pattern  of  planar  dipole  antennas  in  the  12-15  GHz  regime  placed  on  a  layer-by-layer 
PBG  material. 

Theoretical  studies  of  radiation  patterns  of  antennas  with  PBG  substrates  are  needed  in  order 
to  compare  and  probably  improve  the  existing  antennas.  We  can  solve  this  problem  by  using 
the  reciprocity  relation  for  radiation  patterns.  According  to  reciprocity,  the  problem  where  a 
transmitting  antenna  is  placed  on  top  of  a  PBG  material  and  a  receiving  antenna  at  infinity,  is 
actually  equivalent  to  the  problem  where  a  transmitting  antenna  is  placed  at  infinity  and  the 
receiving  antenna  on  top  of  the  PBG  substrate.  The  second  problem  problem  can  be  easily 
solved  using  the  transfer  matrix  method.  Since  the  transmitting  antenna  is  at  infinity,  we  can 
approximate  the  radiated  waves  in  the  vicinity  of  the  surface  of  the  PBG  material  by  plane  waves. 
Using  the  TMM,  we  can  find  the  pattern  of  the  electric  field  on  the  surface  of  the  PBG  materials 
which  is  actually  the  radiation  pattern.  We  tested  this  method  for  a  uniform  dielectric  substrate 
and  the  results  are  in  very  good  agreement  with  numerical  results  reported  by  Cheng  et  al.  For 
the  results  of  Ref.  17,  the  far-field  radiation  of  the  dipole  placed  above  a  finite  dielectric  slab  was 
calculated  by  expanding  the  spherical  dipole  fields  in  plane  waves,  and  utilizing  Fresnel  refiection 
and  transmission  coefficients. 

As  a  next  step,  we  calculate  the  radiation  pattern  of  a  dipole  placed  on  top  of  a  periodic 
multilayer.  The  multilayer  is  constructed  by  two  different  layers  placing  them  one  after  the 
other.  We  call  the  high  and  the  low  dielectric  constants  layers  by  H  and  L,  respectively.  The 
dielectric  constants  of  the  layers  are  e/f  =  9.61  and  ei,  =  1  while  the  ratio  of  their  thicknesses  is 
dn/di  =  0.314.  The  multilayer  consists  of  13  layers  with  total  thickness  of  0.104  m.  Fig.  3  shows 
the  transmission  for  s-  and  p-polarized  waves.  The  width  of  the  gap  increases  as  the  incident 
angle  increases  for  s-polarized  waves,  while  it  tends  to  disappear  as  the  incident  angle  increases 
for  p-polarized  waves. 

Fig.  4  shows  the  radiation  pattern  of  an  electric  dipole  placed  on  top  of  the  multilayer  with 
driving  frequencies  8  and  6  GHz.  The  calculated  pattern  have  been  found  using  the  method 
described  previously.  The  patterns  are  symmetrical  relative  to  the  vertical  axis.  For  8  GHz,  the 
patterns  in  the  E-  and  H-plane  (dashed  and  solid  lines  in  Fig.  4a)  are  similar  for  almost  all  the 
angles  except  for  small  angles  where  a  peak  appears  in  the  E-plane  pattern.  For  6  GHz  (dashed 
and  solid  lines  in  Fig.  4b),  there  are  important  differences  for  the  E-  and  H-plane  patterns.  In 
particular,  the  E-plane  pattern  has  a  peak  at  30°  with  intensity  0.49  while  the  H-plane  pattern 
has  a  peak  at  45°  with  intensity  0.15. 

Preliminary  results  as  well  the  measurment  reported  in  Ref.  17  show  that  in  the  case  of 
the  layer-by-layer  PBG  the  pattern  is  strongly  depended  on  the  driving  frequency  as  well  as  the 
position  and  orientation  of  the  antenna.  Virtually  no  power  is  lost  to  the  PBG  material  resulting 
in  gains  and  radiation  efficiencies  larger  than  antennas  on  other  conventional  substrates. 
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Abstract 


Richardson  extrapolation  can  be  used  to  obtain  an  accurate  solution  to  a  problem 
■while  being  efficient  with  both  computer  time  and  memory.  In  this  paper,  Richardson 
extrapolation  is  used  in  conjunction  with  the  finite  element  method  to  solve  two- 
dimensional  electrostatics  problems.  Numerical  results  are  presented,  and  it  is  seen  that 
high  accuracy  can  be  achieved  when  Richardson  Extrapolation  is  used. 


Introduction 

Error  analysis  has  become  increasingly  important  in  the  computational  sciences,  has 
recently  received  abundant  attention  in  the  scientific  literature.  A  posteriori  error 
estimation  in  conjunction  with  the  first  order  finite  element  method  was  discussed  in  [1]  by 
Babuska,  Duran,  and  Rodriguez..  Also,  in  [2],  adaptive  finite  element  methods  have  been 
employed  by  Eriksson  and  Johnson  to  solve  parabolic  partial  differential  equations.  In  [3], 
Drago,  Molfino,  Nervi,  and  Repetto  discuss  error  estimation  in  the  finite  element 
determination  of  the  magnetic  field  in  a  two-dimensional  region  using  node-based 
elements.  And,  in  [4],  Hutchraft  and  Gordon  discuss  the  use  of  Richardson  extrapolation 
in  one-  and  two-dimensional  finite  difference  problems. 

One  distinct  advantage  of  the  use  of  error  analysis  is  that  it  makes  possible  the 
calculation  of  an  accurate  solution  without  extremely  long  compute  time  or  substantial 
computer  memory  capacity.  In  this  paper,  Richardson  extrapolation  is  used  to  obtain 
accurate  solutions  for  two-dimensional  electrostatics  problems  using  the  finite  element 
method. 

In  the  following,  the  implementation  of  Richardson  extrapolation  to  solve  electrostatics 
problems  will  be  discussed. 
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Technique 


Consider  the  finite  element  analysis  of  electrostatics  problems.  When  linear  basis  and 
testing  functions  are  used  in  the  finite  element  method,  it  can  be  shown  that  the  error  in 
the  finite  element  solution  will  vary  as  ,  where  h  is  the  mesh  spacing  [5],  With 
Richardson  extrapolation,  the  solution  is  calculated  with  two  different  values  of  mesh 
spacing,  and  then  a  more  accurate  solution  is  obtained  by  forming  a  linear  combination  of 
the  two  results.  With  the  correct  combination,  some  of  the  truncation  error  that  is 
introduced  into  the  solution  fi-om  the  choice  of  basis  and  testing  functions  can  be 
eliminated. 

The  solution  is  obtained  with  an  original  mesh  spacing,  h,  and  then  the  spacing  between 
nodes  is  halved  to  {h/2),  and  the  numerical  solution  is  calculated  again.  From  these 
solutions,  an  extrapolate  at  each  node  in  the  original  mesh  is  obtained.  For  example, 
consider  the  use  of  the  one-dimensional  meshes  shown  in  Figures  1  and  2. 


Figure  1.  Original  Mesh 


Figure  2.  Refined  Mesh  with  Halved  Spacing 

The  problem  is  solved  using  the  original  mesh  and  solved  again  using  the  refined  mesh. 
Then,  the  Richardson  extrapolate  solution  is  calculated  using  the  equation: 


3.0 


where  «^(x,7)is  the  extrapolated  solution,  w(x,jp)is  the  original  numerical  solution 
obtained  using  the  mesh  spacing  h,  and  {x,y)  is  the  numerical  solution  calculated 

h 

using  the  refined  mesh  with  mesh  spacing  — . 

Numerical  Results 

As  an  illustration  of  these  ideas  in  the  two-dimensional  case,  consider  the  finite  element 
solution  to  the  problem: 


-  V  •  {aVu)+bu  =  0  on  n 
u-han  dO. 
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For  simpUcity,  we  will  consider  the  problem  in  which  Q  is  the  square  region  from  x=0.0  to 
x=2.0  and  y=0.0  to  2.0  and: 

a(x,:>;)  =  1.0 

=  0.0 

w(0.0,y)  =  0.0 
«(2.0,>’)  =  0.0 
«(x,0.0)  =  0.0 
m(x,2.0)  =  sin(OT  /  2.0) 

The  problem  was  solved  with  meshes  having  3,  6,  9, 12,  and  15  nodes  in  both  the  x  and 
y  directions.  In  each  case,  the  distance  between  nodes  was  then  halved  and  the  problem 
was  solved  again.  The  Richardson  extrapolate  solution  was  then  calculated  using  the 
equation  shown  above. 

In  order  to  quantify  the  accuracy  of  the  solutions,  a  nodal  error  defined  by  the  equation 


nodal  error  = 


numerical  ^lanalytic  ^ 


was  used.  In  this  equation,  the  summation  is  over  all  nodes  at  which  the  solution  must  be 
calculated,  i.e.,  the  interior  nodes.  We  refer  to  this  as  the  nodal  error  because  it  is  a 
measure  of  the  error  of  the  solution  at  the  mesh  node  locations.  The  improved  accuracy 
of  the  Richardson  extrapolate  solution  can  be  seen  when  the  natural  logarithm  of  the  nodal 
error  is  plotted  against  the  logarithm  of  the  spacing  distance,  h.  This  is  shown  for  the 
two-dimensional  case  in  Figure  3.  Because  the  truncation  error  due  to  the  use  of  linear 
basis  functions  is  second  order,  the  error  in  the  original  finite  element  solution  of  the 
problem  varies  as  }?.  This  can  be  seen  from  Figure  3,  as  the  slope  of  the  line  for  the  error 
in  the  original  solution  (solid  line)  is  approximately  two.  However,  the  linear  combination 
that  is  used  to  calculate  the  Richardson  extrapolate  is  such  that  that  part  of  the  error  that 
varies  as  is  canceled  out.  This  is  seen  from  the  fact  that  the  slope  of  the  line 
representing  the  error  in  the  extrapolated  solution  (dotted  line)  is  greater  than  two. 

In  order  to  determine  the  efficiency  of  using  this  technique,  we  investigated  the 
question  of  what  mesh  density  would  be  needed,  without  using  extrapolation,  to  obtain  the 
same  accuracy  as  was  obtmned  using  a  6  x  6  mesh  and  Richardson  extrapolation.  Note 
that  a  rough  idea  of  the  answer  can  be  predicted  from  the  top  graph  in  Figure  3.  It  was 
found  that  if  extrapolation  were  not  used,  it  would  be  necessary  to  use  a  30  x  30  mesh  to 
obtain  as  much  accuracy  as  was  obtained  using  Richardson  extrapolation  on  a  6  x  6  mesh. 
Of  course,  to  calculate  the  Richardson  extrapolate  on  a  6  x  6  mesh,  it  is  necessary  to  also 
calculate  the  refined  solution  on  a  1 1  x  1 1  mesh.  But  this  requires  only  the  use  of  a  121  x 
121  finite  element  matrix  while  the  analysis  of  the  problem  defined  on  the  30  x  30  mesh 
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without  extrapolation  requires  the  use  of  a  900  x  900  finite  element  matrix.  So,  clearly, 
the  use  of  Richardson  extrapolation  is  veiy  efficient  in  terms  of  the  use  of  computer 
memory.  But  it  is  also  beneficial  in  terms  of  the  computational  time  required.  The  total 
time  to  compute  the  Richardson  extrapolate  solution  (time  to  compute  solution  for 
original  spacing  +  time  to  compute  solution  for  halved  spacing  +  time  for  extrapolation) 
was  6.37  seconds,  which  is  much  less  than  the  17.306  seconds  it  took  to  compute  the 
solution  using  the  30  x  30  mesh  and  no  extrapolation. 

In  order  to  get  a  better  idea  of  the  kind  of  improvement  that  the  use  of  Richardson 
extrapolation  makes,  we  consider  the  same  problem  as  was  discussed  above  but  with  the 
boundary  condition  along  the  top  of  the  square  changed  to  u{x,2G)  =  sm{nx).  Figure  4 
presents  a  comparison  of  the  original  solution  obtained  using  a  9  x  9  mesh  (dotted  line) 
and  the  analytic  solution  (solid  line)  along  the  line  y=1.0  while  Figure  5  shows  a 
comparison  of  the  Richardson  extrapolate  (dotted  line)  and  the  analytic  solution  (solid 
line)  along  the  same  line.  Clearly,  the  Richardson  extrapolate  is  considerably  more 
accurate.  In  fact,  at  the  node  locations,  there  is  very  nearly  perfect  agreement  between  the 
Richardson  extrapolate  solution  and  the  analytic  solution. 

Conclusion 

In  this  paper,  the  use  of  Richardson  extrapolation  to  reduce  the  truncation  errors  in  the 
finite  element  analysis  of  two-dimensional  domains  is  discussed.  It  is  observed  that  use  of 
Richardson  extrapolation  yields  highly  accurate  results  with  quite  modest  requirements  in 
terms  of  computation  time  and  computer  memory. 
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Figure  3.  Nodal  Error  vs.  Spacing  Width 
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Abstract 

Chiral  coatings  have  been  advocated  in  the  past  for  management  and  reduction  of  radar 
cross-section  of  a  variety  of  scattering  objects.  The  theoretical  analysis  of  the  scattering  from 
chirally  coated  bodies  is  complicated  due  to  the  fact  that  the  left  and  right  circularly  polarised 
waves  propogate  in  the  chiral  layer  with  different  propagation  constants.  In  this  paper,  the 
T-matrix  method  has  been  used  to  analyse  the  scattering  from  chirally  coated  scatterers. 
The  scattering  characteristics,  viz,  absorption,  extinction,  radar  and  the  scattering  cross 
sections  have  been  computed  for  realistic  chiral  materials.  The  results  have  been  presented 
for  various  volume  concent erat ions  of  embedded  metal  helices,  angles  of  incidence,  frequency 
and  coating  thickness.  Prom  the  results  it  can  be  seen  the  normalised  Radar  Cross  Section 
of  chirally  coated  spheroid  is  8-10  dB  lower  than  the  conducting  spheroid  in  the  range  of 
8-12  GHz.  This  performance,  is  also  superior  in  comparison  with  the  conventinal  RAMs. 
This  reduction  in  the  RCS  has  been  shown  to  be  due  to  increased  absorption  and  reduced 
scattering  cross  section. 


1  Introduction 

The  objective  of  this  paper  is  to  evaluate  the  scattering  characteristics  of  axisymmetric  bodies 
coated  by  a  chiral  material,  when  a  plane  electromagnetic  wave  is  incident  on  it.  By  chirality  we 
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mean  lack  of  symmetry  or  in  other  words,  an  object  which  cannot  be  brought  into  congruence 
with  its  mirror  image  either  by  rotation  or  by  translation  is  said  to  be  chiral  object/material.  A 
lot  of  work  in  the  past  has  been  done  on  chiral/chirally  coated  spheres  and  spheroids  [1],  [2],  [3], 
[4],  etc.  Often  the  permeability  of  the  chiral  materials  has  been  ignored  by  assuming  it  to  be 
equal  to  that  of  free  space  as  seen  in  [1],  [5],  and  [6].  Such  a  treatment  is  not  correct  while 
dealing  with  chiral  materials.  Even  though  the  chiral  inclusions  may  be  non-magnetic  to  be^n, 
the  composite  chiral  material  will  have  magnetic  properties  [7].  The  properties  of  such  chiral 
materials  can  be  controlled  by  altering  the  size  and  concentration  of  the  embedded  chiral  particles 
to  yield  prespecified  chirality. 

The  principle  of  operation  of  realistic  chiral  materials  is  based  on  the  judicious  combination  of 
front  surface  matching  and  absorption.  The  role  of  chirality  is  in  providing  an  additional  degree  of 
freedom  for  this  match  which  cannot  be  achieved  by  using  conventional  Radar  Absorbing  Materials 
(RAMs).  Regardless  of  the  incident  polarisation,  back  scattered  incident  waves  travel  half  the 
time  with  propagation  constant  k^,  the  propagation  wave  vector  for  the  left  circularly  polarised 
wave,  and  half  the  time  with  kR,  the  propagation  wave  vector  for  the  right  circularly  polarised 
wave,  so  that  the  skin  depth  for  the  two  propagating  waves  plays  an  equal  role  in  absorption. 
Chirality  increases  the  average  absorption. 

Mie  solution  has  been  modified  to  compute  the  scattering  cross  sections  of  chirally  coated 
cylinders  and  spheres  of  small  size  [8].  Lakhtakia,  Varadan  and  Varadan  [1]  have  used  the  T- 
Matrix  method  to  analyse  single  layered  chiral  spheroids.  In  this  paper,  the  scattering  from 
multilayered  chiral  objects  has  been  cast  into  a  form  amenable  for  the  application  of  T-Matrix 
method.  The  constitutive  parameters  of  realistic  chiral  materials  made  by  Varadan,  Varadan  and 
Rho  [9]  are  used  to  gauge  the  extent  of  RCS  reduction  possible  with  chiral  coatings. 


2  Formulation  of  the  Problem 

The  Extended  Boundary  Condition  Method  (EBCM)  or  the  Transition  Matrix  (T  -  Matrix) 
technique  was  originally  developed  by  [10]  for  conducting  sphere.  In  this  method  the  fields  in  the 
various  regions  are  expanded  in  terms  of  vector  spherical  harmonics  and  the  principle  of  surface 
field  equivalence  has  been  used  to  evaluate  the  unknown  fields.  The  EBCM  relates  the  unknown 
scattered  field  coefficients  to  the  known  incident  field  coefficients  through  a  matrix  called  the 
T-matrix.  The  T-matrix  method  is  known  to  work  well  for  scatterers  of  arbitrary  shape  and  of 
sizes  beyond  the  resonance  region.  Further  the  polarisation  is  inherent  in  the  formulation.  For 
the  mutilayered  case,  the  formulation  is  done  by  expanding  the  fields  in  each  region  in  terras  of 
vector  spherical  harmonics  with  unknown  (known  coefficients  for  the  incident  fields)  coefficients. 
The  problem  is  divided  into  three  subprobleras  as  given  in  [11],  using  the  surface  field  equivalence 
principle.  Each  subproblem  is  such  that  the  fields  in  the  region  of  interest  are  radiating  in  space, 
with  the  constitutive  parameters  to  be  the  same  as  that  of  the  region  of  interest.  The  sub¬ 
problems  can  be  reduced  to  a  set  of  simultaneous  equations  and  then  can  be  cast  into  a  matrix 
equation.  The  unknown  scattered  field  coefficients  can  be  computed  from  the  known  incident  field 
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coefficients,  from  the  T-Matrix. 


3  Results  and  Discussion 

Ro,  Varadan  and  Varadan  [9]  have  made  left  handed,  right  handed  and  the  racemic  samples  of 
chiral  composites  with  0.8%,  1.6%  and  3.2%  volume  concenterations  [9].  The  matrix  material 
used  by  [9]  is  eccogel  1365-90  from  Emerson  and  Cuming.  The  left  handed  (right  handed)  sample 
consists  of  only  left  handed  (right  handed)  springs,  while  for  the  racemic  samples  the  same  amount 
of  left  and  right  handed  spring  have  been  mixed.  The  sample  consists  of  four  layers.  One  layer  has 
pure  eccogel  and  the  remaining  three  layers  have  chiral  inclusions  inside.  The  spherical  inclusion 
is  made  up  of  metal  because  of  easy  availability  and  relatively  low  cost.  It  could  also  be  made 
of  polymers. The  data  presented  in  [9]  are  the  measurements  made  at  discrete  frequencies  and  are 
used  in  this  work. 

The  real  and  the  imaginary  parts  of  the  permittivity,  permeability  and  the  chirality  parameter 
are  plotted  as  a  function  of  frequency  in  Figs.  1  for  a  1.6  %  and  3.2  %  concentration  of  right 
handed  helices.  The  variation  for  the  left  handed  helices  are  similar  and  hence  not  shown.  From 
Fig.  1  it  can  be  seen  that  the  permeability  of  realistic  chiral  material  may  not  be  set  to  unity. 
This  has  also  been  emphasised  in  [7]. 

Chirally  coated  oblate  spheroids  are  considered  for  illustrative  purposes.  The  axial  ratio  has 
been  chosen  to  be  0.8.  The  equivalent  diameter  of  the  conducting  sphere  has  been  chosen  to 
be  6  cm.  The  coating  thickness  has  been  chosen  to  be  5  mm.  The  results  for  the  scattering 
characteristics  are  presented  for  right  handed  helices  only.  The  results  in  Fig.  2  are  for  two 
different  concentrations  of  the  right  handed  helices,  1.6  %  and  3.2  %  for  vertical  polarisation. 
The  results  for  the  other  polarisation  and  for  left  handed  and  racemic  type  of  helices  are  the 
same  and  hence  not  plotted  here.  The  results  are  actually  for  discrete  set  of  frequencies  and 
thus  the  graph  is  as  shown.  The  results  for  the  absorption  cross  section,  extinction  cross  section, 
radar  cross  section  and  the  scattering  cross  section  are  shown.  The  normalised  RCS  reduces  by 
more  than  12  dB  for  a  3.2%  cone,  and  below  6  dB  for  a  1.6%  cone,  of  right  handed  helices. 
From  Fig.  2  (b)  the  normalised  scattering  cross  section  is  seen  to  reduce  with  the  increase  in  the 
volume  concentration  of  the  helices  which  is  as  expected.  The  normalised  absorption  cross  section 
increases  as  the  volume  concentration  of  the  right  handed  helices  increases,  leading  to  a  higher 
reduction  in  RCS.  As  seen  from  Fig.  2  (d)  the  normalised  extinction  cross  section  is  almost  the 
same  for  both  the  volume  concentrations. 

For  the  purpose  of  comparison  a  practical  RAM  was  chosen  from  [12]  of  Cr  =  1.3  -f-  zO.29. 
An  oblate  conducting  spheroid,  was  coated  by  such  a  RAM  and  the  coating  thickness  was  kept 
fixed  as  5  mm  as  that  of  chiral  coating.  The  inner  aspect  ratio  was  chosen  to  be  0.8.  The  outer 
equivolume  diameter  was  chosen  to  be  6  cm.  The  RCS  of  the  conducting  oblate  spheroid  coated 
by  a  conventional  RAM  is  compared  with  the  chirally  coated  spheroid.  The  response  is  shown  for 
the  frequency  range  of  8-12  GHz.  The  RCS  comparison  between  the  two  coatings  has  been  shown 
for  both  the  horizontal  and  the  vertical  polarisations  in  Figs.  3.  It  can  be  seen  from  the  plots  that 
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Figure  1:  Variation  of  the  complex  permittivity,  permeability  and  the  chirality  parameter  as  a 
function  of  frequency 


for  the  same  coating  thickness,  the  RCS  for  a  RAM  coating  gets  reduced  by  5  dB,  as  compared 
to  a  reduction  of  more  than  15-20  dB  when  coated  by  a  realistic  chiral  material  of  concentration 
3.2%.  The  reduction  in  the  RCS  is  only  5-6  dB  for  a  volume  concentration  of  1.6%.  This  was 
seen  for  both  the  horizontal  and  vertical  polarisations. 

The  variation  in  the  scattering  characteristics  as  a  function  of  the  angle  of  incidence  is  shown 
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Figure  2:  Variation  of  normalised  (a)radar  cross  section,  (b)scattering  (c)absorption  and 
(d)extinction  cross  section  for  an  cMrally  coated  spheroid  of  aspect  ratio  0.8,  equivalent  outer 
diameter  =  6  cm,  as  a  function  of  frequency 

in  Fig.  4  .  The  frequency  has  been  fixed  at  9.87  GHz.  The  values  for  the  complex  permittivity, 
complex  permeability  and  the  complex  chirality  as  taken  from  [9],  are  Cr  =  3.529  +  i3.731,  fir  = 
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Figure  3:  Variation  of  normalised  radar  cross  section  for  a  RAM  and  a  chiral  coated  oblate  spheroid 
for  an  aspect  ratio  of  0.8,  (a)horizontal  (b)vertical  polarisation  as  a  function  of  frequency 


Figure  4;  Variation  of  (a)normalised  radar  cross  section  (b)absorption  cross  section  for  a  RAM 
and  a  chiral  coated  oblate  spheroid  for  an  aspect  ratio  of  0.8,  as  a  function  of  angle  of  incidence 

0.820  -  iO.353  and  /3  =  (18.091  -  il2.01)  x  10"^  cm.  These  are  for  RCP  helices  embedded  in  the 
host  medium  with  a  3.2%  volume  concenteration.  The  results  have  been  plotted  for  only  vertical 
polarisation  even  though  the  results  are  the  same  for  horizontal  polarisation..  These  results  are 


453 


Figure  5:  Variation  of  (a)  normalised  radar  cross  section  (b)  absorption  cross  section  for  a  RAM 
and  a  chiral  coated  oblate  spheroid  for  an  aspect  ratio  of  0.8,  as  a  function  of  normalised  coating 
thickness 

compared  with  an  conventional  RAM  coating  of  =  1-3  +  iO.29  at  9.87  GHz.  The  coating 
thickness  is  kept  the  same  as  that  for  the  chiral  coating.  The  aspect  ratio  of  0.8  has  been  chosen 
for  the  RAM.  As  seen  from  the  graphs,  the  normalised  RCS  for  RAM  coated  oblate  spheroid  for 
this  particular  frequency  is  less  than  that  for  the  chirally  coated  spheroid.  The  absorption  cross 
section  for  the  RAM  is  almost  half  of  that  seen  for  a  chirally  coated  conducting  spheroid. 

Fig.  5  shows  the  variation  of  the  scattering  characteristics  of  RAM  and  chirally  coated  oblate 
spheroid  as  a  function  of  normalised  coating  thickness  for  vertical  polarisation.  The  outer  oblate¬ 
ness  has  been  chosen  to  0.8  and  the  equivalent  diameter  has  been  fixed  to  be  7  cm.  The  frequency 
has  been  chosen  to  be  9.87  GHz  and  the  constitutive  parameters  are  same  as  that  chosen  earlier 
in  the  case  of  angle  of  incidence  variation.  The  results  are  shovm  once  again  for  a  right  handed 
helices  embedded  in  a  epoxy  host  medium.  As  seen  from  the  figure  the  reduction  in  the  RCS  for 
a  chirally  coated  spheroid  is  more  than  for  a  RAM  coating  for  small  coating  thicknesses.  The 
reduction  in  RCS  for  a  normalised  coating  thickness  of  0.12  is  about  21  dB  for  chiral  coating  and 
about  3  dB  for  a  RAM  coating.  The  sharp  null  in  the  RCS  that  is  seen  around  the  normalised 
coating  thickness  of  0.62  (coating  thickness  of  2.2  cm)  is  due  to  the  resonance  occuring  when  the 
coating  thickness  is  nearly  a  multiple  of  A  and  is  of  low  bandwidth.  However  the  RCS  reduction 
seen  in  chiral  coating  is  devoid  of  such  oscillations,  and  would  offer  wider  bandwidth.  The  ab¬ 
sorption  cross  section  is  about  1.2  for  the  chiral  coating  and  0.5  for  RAM  coating  for  the  same 
coating  thickness.  As  the  coating  thickness  increases  the  conducting  spheroid  reduces  to  a  fully 
chiral/dielectric  spheroid  of  oblateness  (outer)  0.8. 
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Abstract  -  This  paper  presents  a  general  method,  based  on  the  electrokinetic 
problem,  to  compute  the  source  current  density  in  inductors  of  any  shape.  The  so- 
computed  current  density  has  the  same  continuity  property  as  die  physical  one 
because  the  zero  divergence  condition  is  strongly  satisfied.  Moreover,  using  a 
tensorial  form  of  the  electrical  conductivity,  coU  inductors  can  be  modelized  in 
addition  to  massive  inductors.  The  computed  current  density  can  be  directly 
introduced  in  a  magnetic  vector  potential  formulation  or  in  some  kind  of  mixed  h-a 
formulation  or  used  to  compute  a  source  magnetic  field  which  is  needed  in  a  h 
formulation  using  a  magnetic  scalar  potential. 

I.  Introduction 

Solving  magnetostatic  or  eddy  current  problems  using  a  magnetic  vector  potential  formulation  or 
some  kind  of  mixed  formulation  requires  the  computation  of  a  source  current  density  associated  with 
the  inductors.  Moreover,  a  formulation  using  a  magnetic  scalar  potential,  i.e.  a  h  formulation, 
necessitates  the  computation  of  a  source  magnetic  field  hg  associated  with  inductors.  Several  methods 
have  been  proposed  by  many  authors  [1,2, 3,4]  to  compute  such  source  fields,  but  are  not  general  in  the 
sense  that  they  are  either  dependent  on  the  shape  of  the  inductor  [1,2]  or  usually  need  a  high  CPU  time 
[3,4].  In  the  following  is  presented  a  method  entirely  general  to  compute  the  source  current  density  in 
inductors  of  any  shape  and,  eventually  thereafter,  a  source  magnetic  field.  This  method  enables  to  take 
into  consideration  different  kinds  of  inductors,  i.e.  massive  or  coU  inductors. 

II.  A  mixed  formulation  based  on  the  electrokinetic  problem 


The  electrokinetic  problem  consists  in  finding  two  vector  fields,  the  electric  field  e  and  the  current 
density],  in  an  inductor  Q  (Fig.  1),  These  fields  have  to  satisfy  the  equations  obtained  from  Maxwell’s 
equations  where  time  dependent  terms  are  omitted,  i.e. 

div  j  =  0  in  Q  ,  (1) 

curl  e  =  0  in  Q  ,  (2) 

associated  with  the  boundary  conditions,  on  the  complementary  parts  Fj  and  Fg  of  9f2, 
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(3) 

(4) 


n  .  j  =  0  , 

nxe=0  onTg, 

where  n  is  the  external  unitary  normal  of  the  corresponding  surface.  To  these  equations  is  added  the 
electrical  constitutive  law,  i.e.  the  local  Ohm  law 

j  =  a  e  ,  (5) 

where  cr  is  the  electrical  conductivity  of  the  inductor  Q.  As  it  will  be  explained  further,  the  conductivity 
can  be  either  scalar  or  tensorial  to  be  able  to  take  into  consideration  different  kinds  of  inductors. 

Since  the  electric  field  e  is  a  zero  curl  field  (2),  it  can  derive  from  a  scalar  potential  v,  i.e. 

e  =  -  grad  v  . 


Fig.l.  Inductor  Q  and  its  complementary  boundaries  Tj  and  Te  (Fg  =  r®  u  Fg). 

The  Withney  sequence  [6]  and  its  generalized  form  [7]  defining  some  mixed  finite  elements,  appear  to 
be  well  suited  to  the  approximation  of  fields  in  mixed  formulations  [5,8].  Such  finite  elements  can  be 
defined  on  tetrahedra,  hexahedra  and  prisms.  The  so-defined  finite  element  spaces  are  associated  with 
nodes,  edges,  facets  and  volumes  of  the  discretized  domain,  i.e.  the  mesh,  and  are  respectively  denoted 
S^  S^,  S2  and  S^  [7].  The  sets  of  nodes,  edges,  facets  and  volumes  of  the  mesh  are  denoted  by  N,  E,  F 
and  V,’  respectively.  Their  sizes  are  #N,  #E,  #F  and  #V.  A  subscript  associated  with  a  function  space  (-j 
or  -g)  denotes  the  associated  boundary  condition  (on  Tj  or  Fg ). 

The  following  notations  are  used  for  integrations  of  products  of  scalar  or  vector  fields  over  a  volume 
a,  where  L2  and  L2  are  the  spaces  of  square-summable  scalar  and  vector  functions : 

(u,  v)  =  J  u(x)  v(x)  dx ,  u.  V  €  L^(f2)  ,  (u,  v)  =  J  u(x) .  v(x)  dx ,  u,  v  €  (f2)  . 

a  ^ 

From  the  equations  of  the  electrokinetic  problem  (1-6),  a  mixed  formulation  can  be  written  where  two 
fields,  j  and  v,  are  approximated  simultaneously.  The  problem  is  to  find  j  in  S2  and  v  in  S^,  S  and  S 
being  the  function  spaces  of  facet  and  volume  finite  elements  defined  on  the  discretized  domain  [7],  such 
that 

(  C7-1  j  ,  j’ )  -  (  V  ,  div  j')  =  0,  Vj’€H(divj),  (7) 

(divj,v')  =  0.  Vv'€L2(Q),  (8) 
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H(diVj)  and  L2(a)  being  approximated  by  and  S^;  j’  and  v'  are  test  functions  of  these  spaces. 
Equation  (7)  is  actually  a  weak  form  of  the  definition  of  an  electric  scalar  potential  v,  given  by  (6),  and  is 
obtained  by  its  integration  by  parts  while  equation  (8)  expresses  a  strong  form  of  the  charge  conservative 
equation  (1).  These  equations  can  also  be  obtained  using  Lagrangian  multipliers  [12],  these  being 
interpreted  as  associated  with  a  weak  electric  potential  v. 

Facet  and  volume  finite  elements  are  used  to  approximate  respectively  the  unknown  current  density  j 
and  the  scalar  potential  v,  i.e. 

j  =  Z  jf  sf  V  =  2^  v^  Sk  ,  (9-10) 

jeF  keV 

where  sj^  and  s^  are  respectively  the  sh^e  functions  associated  with  the  j-th  facet  in  set  F  and  the  k-th 
volume  in  set  V  of  the  mesh  and  where  jj  and  v^  are  the  associated  degrees  of  freedom,  respectively 
the  flux  of  the  current  density  across  the  j-th  facet  and  the  volume  integral  of  the  potential  v  in  the  k-th 
volume.  Then,  the  discretization  of  equations  (7)  and  (8)  leads  to  the  following  finite  element  equations: 

)jf-  X(s^,divsf  )v^=0  ,i=l,  ...,  #F,  (11) 

jeF  keV 

X(divsf,s>^)j[=0  ,i=l,  ...,  #V.  (12) 

jeF 

Therefore,  by  imposing  the  current  density  on  a  cross  section  of  an  inductor  of  any  shape  and  by 
setting  to  zero  its  normal  component  on  the  lateral  faces  (which  define  essential  boundary  conditions), 
the  source  current  density  and  the  potential  v  can  be  calculated  everywhere  in  the  inductor  by  using  the 
proposed  mixed  formulation.  This  source  field  has  the  same  continuity  property  as  the  physical  one,  i.e. 
the  continuity  of  its  normal  component,  because  the  zero  divergence  condition  is  strongly  satisfied  by 
(8).  This  method  presents  the  advantage  of  being  entirely  general  for  inductors  of  any  shape  and  requires 
no  more  CPU  time  than  usual  finite  element  computations. 

III.  Massive  and  coil  inductors 

The  electrical  conductivity  o  introduced  in  the  local  Ohm  law  (5)  can  take  different  mathematical 
forms:  it  can  be  either  scalar  or  tensorial. 

First,  if  the  conductivity  is  chosen  as  being  scalar,  equal  to  the  conductivity  of  the  material,  the 
inductor  is  therefore  considered  as  an  isotropic  material  with  identical  electrical  properties  in  any 
direction.  In  consequence,  only  massive  inductors  can  be  modelized  with  this  kind  of  conductivity.  The 
obtained  current  density  will  be  higher  in  the  vicinity  of  the  interior  bends  than  in  the  vicinity  of  the 
exterior  bends. 

This  kind  of  current  distribution,  characteristic  of  massive  inductors,  cannot  be  accepted  if  one  wants 
to  modelize  coil  inductors,  in  which  the  current  is  not  affected  by  the  bends  and  remains  uniformly 
distributed  on  any  cross  sections.  An  accurate  determination  of  the  current  distribution  in  coil  inductors 
is  necessary  for  an  accurate  evaluation  of  force  or  torque  quantities  [10].  This  can  be  done  by 
introducing  a  tensorial  form  of  the  conductivity  instead  of  a  scalar  one.  In  this  way,  coil  inductors  can  be 
taken  into  account  thanks  to  an  anisotropic  tensor  a  having  a  principal  conductivity  along  the  wires  of  the 
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inductor  equal  to  the  conductivity  of  the  material,  and  two  weak  principal  conductivities  in  the 
perpendicular  directions  to  the  wires,  resulting  from  the  residual  insulator  conductivity. 

In  each  finite  element  of  the  discretized  conductor,  one  principal  direction  of  ^isotropy  is  ^osen 
along  the  fiber  of  the  inductor,  the  two  others  being  perpendicular  to  this  direction  (Fig.  2).  Fig.  3 
illustrates  the  alignment,  in  each  finite  element,  of  the  principal  axis  along  the  fiber  of  a  circular  inductor. 


Fig.  2.  The  three  principal  axis  in  a  finite  element.  Fig.  3.  Alignment  of  one  principal  axis  along  the 

fibers  of  an  inductor. 


IV.  Magnetic  vector  potential  formulation 


The  equations  of  the  magnetostatic  problem  are 

curl  h  =  is  ,  div  b  =  0  ,  b  =  p  h  ,  (13-14-15) 

where  h  is  the  magnetic  field,  b  is  the  magnetic  flux  density,  jg  is  the  source  current  density  associated 
with  the  inductors  and  \i  is  the  magnetic  permeability.  A  magnetic  vector  potential  a  can  be  defined  by 

b  =  curl  a  , 

so  that  (14)  is  impUcitiy  satisfied.  Then,  by  expressing  (13)  in  a  weak  form  and  introducing  (15)  and 
(16),  one  gets 

( |i-l  curl  a  ,  curl  a’ )  =  (  is  .  a’),  V  a' e  H(curle)  ,  (17) 

H(curle)  being  approximated  by  Sh  a'  is  a  test  function  of  this  space.  If  edge  elements  are  used  for  the 
dicretization  of  a,  this  equation  takes  the  following  form 


where  sf  is  the  shape  function  associated  with  the  j-th  edge  of  the  mesh  and  aj  is  the  circulation  of  *e 
vector  potential  along  this  edge  [7,1 1].  A  gauge  condition  on  a  can  be  defined  in  the  space  of  edge  finite 
elements  thanks  to  a  tree  built  in  the  mesh  [1 1];  #Ect  is  the  number  of  edges  of  the  associated  co-tree  and 
the  equations  (16)  are  only  associated  with  these  edges. 
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One  way  to  deal  with  this  problem  is  first  to  solve  (1 1)  and  (12),  and  afterwards  to  solve  (18),  which 
is  more  efficient  for  transient  or  nonlinear  problems.  Another  possibility  is  to  solve  (11),  (12)  and  (18) 
simultaneously,  which  leads  to  a  matrix  associated  with  the  system  given  by 


"Si  C  0" 
S2  D 

.0  0, 


(19) 


The  number  of  edges,  facets  and  volumes  being  respectively  denoted  #E,  #F  and  #V,  matrix  Si,  83,  C 
and  D  are  of  size  (#Ex#E),  (#Fx#F),  (#Ex#F)  and  (#Fx#V)  respectively,  the  number  of  equations 
being  equal  to  #E+#F+#V,  The  global  matrices  are  indefinite  so  that  classical  iterative  techniques  such 
as  conjugate  gradients  are  not  suitable.  So,  the  large  number  of  equations  and  the  very  small  bandwidth 
have  led  to  use  the  following  sequence  of  algorithm  to  solve  the  entire  system:  RCMK  reordering  of  the 
matrix,  ILUTP  preconditioner  and  GMRES  solver  [9].  This  sequence  takes  no  more  computational  time 
and  is  even  more  efficient  than  ordinary  gradient  methods. 

Other  kinds  of  formulations,  such  as  mixed  h-a  formulations  for  instance  [13],  which  require  the 
knowledge  of  the  source  current  density,  can  be  coupled  with  the  proposed  mixed  formulation  for  the 
computation  of  that  source  field.  Furthermore,  the  source  current  density  can  also  be  used  to  calculate 
a  source  magnetic  field  hg  which  is  needed  by  formulations  using  a  magnetic  scalar  potential,  i.e.  h 
formulations.  Indeed,  the  curl  of  a  source  field  hg  has  to  be  equal  to  jg,  which  can  be  satisfied  exactly 
using  edge  finite  elements  to  interpolate  hg  [3]. 


V.  Applications 


In  order  to  illustrate  the  effectiveness  of  the  method  mentioned  above,  the  source  current  density  in 
practical  inductors  surrounded  by  air  has  been  computed.  The  same  mesh  is  used  for  both  massive  and 
coil  inductors  (Fig.  4). 


Fig.  4.  Mesh  of  a  part  (l/4th)  of  the  studied  inductors. 

Fig.  5  and  6  show  the  current  density  distribution  on  two  cut  planes  in  a  copper  massive  inductor 
whose  conductivity  is  equal  to  5.9  10^  S/m.  As  mentioned  above,  those  figures  show  that  the  current 
density  is  higher  in  the  vicinity  of  the  interior  bend  of  the  inductor. 
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Fig.  5.  Current  density  distribution  in  the  massive  inductor. 


exterior  bend 


Fig.  6.  Current  density  distribution  in  the  massive  inductor  (cut  plane  in  the  bend). 

On  the  other  hand,  a  copper  coil  inductor  is  also  considered,  with  a  principal  conductivity  along  the 
direction  of  the  wires  equal  to  the  copper  conductivity  and  two  principal  conductivities  along  the 
perpendicular  directions  equal  to  lO"®  S/m.  Fig.  7  shows  the  computed  current  density  distribution.  In 
that  case,  the  current  density  distribution  is  not  affected  by  the  bend  and  is  the  same  as  in  the  initial  cross 
section  of  the  inductor;  it  remains  uniform  all  along  the  inductor. 


Fig.  7.  Current  density  distribution  in  the  coil  inductor. 
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The  computed  source  current  density  can  then  be  used  in  the  a-formulation.  As  an  example,  Fig.  8 
shows  the  magnetic  flux  density  distribution  in  the  air  around  a  coil  inductor  of  another  shape.  This 
figure  also  shows  the  computed  source  current  density  in  the  inductor. 


Fig.  8.  Source  current  density  (on  plane  z  =  0)  in  a  coil  inductor  and  associated 
magnetic  flux  density  (on  planes  x  =  0  and  y  =  0)  computed  with  a  a-formulation. 


VI.  Conclusions 

A  mixed  formulation  for  the  computation  of  the  source  current  density  appears  to  be  a  simple  and 
accurate  method  and  is  always  applicable  to  inductors  of  any  shape.  Moreover,  the  method  of  defining 
an  anisotropic  conductivity  to  take  into  account  coil  inductors  appears  to  be  reliable  and  can  be  applied  in 
any  coil  inductor  to  compute  the  associated  source  fields.  Finally,  the  method  can  be  coupled  with  any 
formulation  needing  the  computation  of  a  source  field,  i.e.  a  source  current  density  or  a  source  magnetic 
field.  In  each  case,  the  interpolation  of  the  source  current  density  in  the  space  of  facet  finite  elements  is 
well  adapted  to  each  formulation,  which  requires  the  continuity  of  this  density. 
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fflGH  POWER  MICROWAVE  AMPUFICAIION  BY  fflGH-INTENSITY 


RELATTVISnC  ELECTRON-BEAM  STORAGE-RINGS 
Ross  A.  Spedale 
Redondo  Beach,  California 

A  very  interesting  and  diallen^ng  set  of  rather  fundamental  computational  electromagn^cs 
problems  is  posed  by  the  development  of  a  recently  conceived  new  type  of  high  power  microwave 
amplifier,  that  is  capable  of  generating  gigawatt-level  output  peak-pulse  powers,  anywho^  in  the  1  to  40 
G^  frequaicy  range,  with  high  power  gain,  ^cien<7,  duty  cycles,  phase  coherence,  and  spectral  purity. 

The  new  type  of  high  power  microwave  amplifia*  combines  the  basic  concepts  of  the  relativistic 
klystron  amplifier  (RKA)  with  the  well-known  technology  of  electron-beam  storage-rings  (EBSR),  by  using 
a  high-intensity,  relativistic  electron-beam  (HIREB),  circulating  along  re-entrant,  multi-turn,  toroidal- 
helical  orbits.  The  nominal,  toroidal-helical  equilibrium  orbits  wrap  many  times  around  a  virtual  toroidal 
surface,  with  large  radius  R  ,  and  small  radius  r ,  exactly  closing  after  a  int^ral  number  of  turns. 

A  nine-turn  set  of  re-entrant,  toroidal-helical  orbits  Is  shown  in  figures  1-3  ,  as  an  example. 

Unlike  the  Gyrotron  and  the  Relativistic  Magnetron,  the  new  electron-beam  device  operates  as  a  low 
phase-noise,  high  power-gain  amplifier,  rather  than  a  self-osdliator.  A  sharply  bunched,  high-intensity 
relativistic  electron  beam,  typically  carrying  a  10*  Amp  avo^e  current  at  a  1  Mev  electron  eno^, 
interacts  along  the  re-entrant,  multi-turn,  toroidalkielical  orbits,  with  two  azimuthally-periodic, 
electrically-large,  and  mutually-interleaved  microwave  wave-guiding  structures,  eadi  comprising  widely- 
separated,  multiple  clusters  of  mutually-coupled  beam-bunching,  and  power-extraction  cavity  resonators. 

The  two  microwave  wave-guiding  structures  parform  the  functions  of  input,  and  respectively  output 
microwave  circuits  of  the  ampliner,  and  can  operate  on  harmonically-rdated  input,  and  output 
firequendes,  if  the  input  beam-bunching  cavities  are  coupled  to  a  single  turn  of  the  re-entrant,  multi-turn, 
toroidal-heUcal  orbit,  while  the  output  power-extraction  cavities  are  coupled  to  all  the  turns.  In  this  case 
the  high  power  microwave  amplifier  also  performs  an  additional  frequenty-multiplication  function. 

High  effidency,  and  high  duty  cydes  are  attained  by  re-drculating  the  high-intensity  relativistic 
dectron  beam,  rather  than  wasting  most  of  its  stored  power  in  a  beam-dump,  as  in  the  still  largely 
experimental  relativistic  klystron  amplifiers  described  to  date.  Further,  no  high-power  bank  of  large 
storage  capadtors  is  required,  as  very  large  total  drculating  beam  current  can  be  attained  in  the  re¬ 
entrant,  multi-tum,  toroidal-helical  electron  orbit,  with  each  turn  carrying  only  a  fraction  of  the  total 
drculating  current,  and  by  gradually  building  up  the  drculating  beam  current  by  synchronous,  multi-tum 
beam-injection  (beam-stacking). 

A  very  large  number  of  high-space-charge-density  dectron-bunches  drculates,  within  azimuthally- 
poiodic,  aUemating-gradient  (AG)  guiding  magnetic  fields,  along  the  re-entrant,  multi-tum,  tm^idal- 
helical  orbits  of  the  new  type  of  high  power  microwave  amplifier,  at  an  essentially  constant  orbital 
frequency.  A  total  of  120  electron  bunches  are  shown  as  small  drdes  in  the  figure  1-3  examples,  where 
the  electron  bunches  are  shown  admuthally  spaced  by  IT  along  each  of  the  nine  orbit  turns. 

The  bunching  cavity  resonators  of  the  input  microwave  stmeture,  and  the  power-extracting  cavity 
resonator  of  the  output  stmeture  are  interleaved  to  the  azimuthally-periodic  set  of  storage-ring  magnets, 
that  generate  the  ’strong  focusing,  aUemating-gradient’  ,  guiding  magnetic  field.  The  use  of  strong- 
focusing,  alternating-gradient  guiding  fidds  (apparently  never  before  applied  to  the  high-intensity, 
relativistic  RKA  electron  beams)  vastly  reduces  the  required  beam-aperture. 

Ftolher,  a  large  number  of  electrostatic  beam-deflection  channels,  periodically  distributed  around 


464 


the  dectron-beam  storage-ring,  introduce  a  radially-oriented  DC  dectric  d^ection-fidd,  poinfang  outward 
of  the  rirtual  toroidal  surface  of  the  equilibrium  orbits,  along  the  direction  of  the  torus  small  radius  r . 
The  radially-oriented  dectric  fidd  provides  the  mput  DC  power  to  the  ampUfier,  by  fordng  the  path  of 
partially  energy-depleted  dectron-bunches  that  exit  the  power-extraction  cavity  ^nators,  on  a  constant, 
nominal  small  radius  r.  The  described  electrostatic  beam-deflection  diwnels  deliva-  all  the  required  input 
DC  current,  and  power  required  for  operation,  performing  a  function  similar  to  that  of  the  radially- 
oriented  dectrostaUc  deflection  fidd  is  similar  to  that  of  the  ^crossed*  radial  dectric  fidd  between  the 

cathode  and  the  anode  of  a  conventional  magnetron.  ^  .  u  • 

The  new  high-power  microwave  amplifier  concepts  [HOvides  a  numba*  of  substantial,  rather  obvious 
technical  advantages.  First,  high  efficiency  is  attained  by  re-drculating  the  high-intensity  dedron-beam, 
thus  totally  recovering  its  residual  kinetic  energy.  This  efiminates  the  need  for  a  be^-dump,  ^d  for 
a  multi-joule  pulse^encrator.  The  continuous  supply  of  DC  power  by  the  dectrostatic  beam-ddlection 
rfiannffk  makes  higher  duty-cydes  possible,  as  no  large  bank  of  capadtors  needs  re-charging  bdween 
output  pulses.  The  large  number  of  orbit  turns  reduces  the  average  current  each  carries,  and  the  v^ 
large  total  number  of  dectron  bunches  reduces  the  required  local  spaced^ge  density.  The  large  number 
of  power-extracting  cavity  resonators  reduces  the  power  each  of  the  cavities  is  r^uired  to  extract  from 
the  beam,  a  most  remarkable  advantage  with  the  smaller,  higher  frequendes  cavities.  The  use  of  strong- 
focusing,  alternating-gradient  guiding  fidds  largdy  reduces  the  required  beam  aperture  in  both  the  input 
bunching  cavities,  and  the  output  power-extracting  cavities.  Finally,  the  capability  of  combining 
frequency-multiplication  with  power  amplification,  eliminates  feedback-induced  instabiUty, 

These  technical  advantages  may  be  attained  by  solving  a  number  of  new,  challenging  problems. 
First,  the  wdl-established  theory  of  betatron,  and  synchrotron  osdllations  in  strong-focusing, 
alternating-gradient  guiding  fidds  [1,2]  must  be  reformulated  in  the  context  of  high  space-charge 
densities,  that  induce  high-intensity  sdf-generated  fields  around  the  dectron  bunches.  These  fidds  may 
provide  beam  self-focusing,  under  appropriate  conditions.  Fiirther,  the  dectro-dynamic  be^loading  on 
the  input  and  output  cavity  resonators  must  be  evaluated,  as  a  basis  for  the  design  of  the  input,  power- 
spUtting  microwave  structure,  and  of  the  high-power,  power-combining  output  structure.  The  impedanro 
matdiing  to  both  sets  of  cavity  resonators  is  obviously  rather  critical  at  the  indicated  power  levels,  as  is 
the  amplitude  and  phase  stability  of  the  resonator  internal  fields. 

The  input,  and  output  microwave  wave-guiding  structures  must  therefore  meet  stringent  phase- 
coherence,  and  phase-stabifity  requirements,  in  performing  the  input  power-splitting,  and  respectively  the 
output  power-combining  functions,  under  conditions  of  high  peak-pulse  power,  and  the  high^ergy 
dectrodynamic  loading,  and  cavity  cross-coupling,  due  to  the  circulating  dedron-beam  bunches.  These 
requirements  are  somewhat  similar  to  those  that  must  be  md  by  the  be^forming  networks  of  large, 
high-directivity  phased  arrays,  where  the  dectroma^dic  proximity  coupling  betweoi  radiating  dements 
can  not  be  negleded,  without  incurring  in  the  notorious  array-blindness  effed. 
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1.  INTRODUCTION 

Ionosphere  sounding  consists  of  transmitting  of  short  HF  radio  pulses  on  a  set  of 
jfrequendes  and  registration  of  characteristics  of  reflected  of  ionosphere  signals.  Registered 
parameters  of  signals  may  he;  time  delay,  that  is  called  virtual  hei^t  of  reflection  region, 
anqilitude,  phase,  polarization  of  the  received  signal.  Now  for  acquisition  of  these  data  are 
used  digital  ionosonde.  The  block  diagram  of  our  ionosonde  "Pams”  is  presented  on  fig.  1. 
The  sounding  pulses  are  formed  by  digital  Receiver-synthesizer.  High  fi-equency 
transmitting  signal  Fe  enters  to  wideband  Power  anqilifier  and  further  through  Antenna 
switch  is  excited  by  transmitting  antenna.  The  signal  Fr,  reflected  of  ionosphere,  enters  firom 
receiver  to  A/D  converter.  Signal  processor  performs  aU  necessary  actions  with  digiteed 
data  Ud  according  to  regime  given  by  personal  conqiuter  (PC).  The  total  control  of  all 
processes  of  different  modules  of  ionosonde  is  managed  by  PC  and  modulus  of 

Synchronization  and  control.  . 

In  our  rqjort  we  describe  the  results  of  using  DSP  TMS320C25  in  order  to  provide  the 

process  of  sounding  data  treatment. 

The  feature  of  DSP  work  is  its  periodic  repeatability,  synchronized  the  process  of  radio 
pulses  excitation.  The  speed  of  processing  of  arriving  data  should  be  sufficient  for 
maintenance  in  rate  of  sounding  pulses.  Though  the  volume  of  data,  received  m  each  cycle 
of  sounding  is  conqiarably  smaU  (2  Kwords  in  our  case),  it  is  required  to  perform  the 

sufficiently  large  calculations  to  the  utmost  fast. 

After  getting  of  2*1024  samples  from  ADC  on  a  set  of  distances,  DSP  starts  the 
processing  of  received  digital  data:  calculations  values  of  quadrature  components,  phases 
and  anqilitudes,  and,  in  dependence  on  given  regime,  proceeds  the  detection  of  signals, 
estimation  of  current  receiver  attenuation  or  characteristics  of  accepted  signal. 

The  main  operations,  executed  by  DSP  processor  are: 

-reception  of  realizations  of  signals  and  their  buffering; 

-quadrature  processing  of  realizations; 

-anqihtude/phase  detecting  of  signals  fi:om  quadratures; 

-separation  of  0/X  conqionents; 

-calculation  of  threshold  of  signal  separately  in  both  charnels; 

-estimation  of  airqilification  factor  using  calculated  amplitudes; 

-average  filter  (  16  points  ); 

-"triangular"  filter  (  3  points  ); 

-accumulation  of  realizations; 

-search  of  useful  signal; 

-search  of  anqilitude  maidma; 

-correlation  processing  of  signals  in  case  of  use  of  Pseudo-noise  coding. 
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Fig.  1 


All  operations  are  proceeded  simultaneously  in  two  channels  (we  use  two-channel  radio 
receiver  to  get  the  opportunity  of  separation  Ordinary/Extraordinary  components  of 
received  signals.  The  DSP  software  allows  to  treat  data  fi'om  two  types  receiver;  the  first 
with  final  intermediate  Jftequency  (IF)  128  KHz,  the  second  with  zero  IF  and  quadrature 
output.  Almost  aU  program  components  in  both  cases  are  common,  despite  the  fact,  that  in 
the  system  with  IF  128  KHz  first  operation  is  the  calculation  of  quadratures. 

2.  DESCRIPTION  OF  MOST  COMPLICATED  COMPONENTS  OF  SOFTWARE 

DSP25 

2.1  THE  QUADRATURE  PROCESSING  OF  REALIZATIONS. 

We  use  two  realizations  of  procedure  of  quadrature  restoration  from 
digitized  IF  data.  First  is  based  on  the  less-square  method.  Let  ,  k=l,...,  n  realization  of 
signal  on  IF.  We  designate: 
x=27tF0/Fd, 

A=sin(nx)cos((n-f- 1  )x)/sin(x), 

B=sin(nx)sin((n+ 1  )x)/sin(x), 

Where:  FO  -  average  fi-equency  of  signal  IF,  Fd  -  sampling  frequency. 

Then  the  following  estimation  of  the  quadrature  coitq)onents  As  and  Ac  takes  place; 
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Ac  =  2sm^ 
As  =  2  sin^ 


W 

w 


(n  -  A)[U,  mixY-.M,  cos(«it)}-  £{^1  w{x)+...+U,  sm(>K)} 
-  sm(x))^  -  sin^  (m) 

(n  +  A){U,  sin(x)+...-R/,  sin(m))-  B{C/i  cos(>:)+...4j7,  cos(kx)} 
(nsm(3c))^  -siji^(«x) 


Here  n  -  number  of  data,  on  whicli  are  evaluated  "quadratures"  Ac  and  As.  We  used  n  16, 
at  which  the  received  estimation  of  signal  of  constant  level  has  the  error  not  more  than 

0.5  %.  ^  f 

Second  variant  of  procedure  uses  only  two  samples  of  ADC  for  estimation  ot 
quadratures.  Except  really  measured  data  the  special  precalculated  array  of  reference 
frequency  IF  is  used.  The  defect  of  such  algorithm  is  the  loss  of  absolute  value  of  phase  of 
received  signal,  but  the  amount  of  calculations  is  approximately  in  five  times  reduced. 

2.2  AMPLITUDE/PHASE  DETECTION  OF  SIGNAL. 

The  detection  of  anqilitudes  and  phases  of  signals  is  proceeded  similarly  to  the  procedure, 

accepted  in  signal  processor  "MOST"  [1].  ^  ^  • 

First  we  determine  the  maximal  element  from  As  and  Ac.  Tlie  binary  value  of  fraction 
Amin  /  Amax  is  the  argument  of  table  fimction; 


After  tabulated  Log  transformation  of  Amax  general  resulting  value  of  signal  anqilitude  can 
be  received  with  use  of  inverse  tabulated  exponential  transformation. 

The  fraction  Amin/Amax  also  in  aggregate  with  conqilex  of  logic  conditions  is  used  for 
tabulated  determination  of  signal  phase. 

2.3  CALCULATION  OF  SIGNAL  THRESHOLD. 

The  procedure  is  based  on  calculation  of  average  anqilitude  of  signal  Am,  average 
deviation  Dm,  received  threshold  is  equal  Am+kDm,  where  k  is  given  by  PC  processor. 

2.4  THE  ESTIMATION  OF  FACTOR  OF  AMPLIFICATION  ON  CALCULATED  AMPLITUDES 

The  realized  procedure  is  similar  to  one  used  in  [2],  except  for  other  levels  of  thresholds  of 
overflow  counters.  This  levels  are:  8,  16,. ..,1024,  1984.  The  coefficient  of  receiver 
amplification  is  calculated  with  the  st^  6  dB. 

2.5  THE  FILTERING  OF  SIGNAL. 

Given  realization  has  the  opportunity  to  make  the  operation  "  Sliding  average  ’’  on  16 
points  of  amphtude  of  signal  as  before  accumulation,  as  afterwards.  The  "triangular 
FILTER"  is  realized  on  3  points.  It  is  used  recursive  procedure: 

u,  =  {u  2U ,+ U 

so  here  it  does  not  arise  signal  time  shift. 


471 


2.6  SEARCH  OF  USEFUL  SIGNALS. 


Hie  selection  of  useful  signals  in  realization  is  proceeded  using  calculated  threshold.  In 
target  files  only  the  elements,  which  anq)htude  exceeds  the  threshold  level,  remain.  All  other 
are  filled  in  by  zero. 


3  RESULTS 

As  the  results  we  show  the  times  of  completion  of  operations  of  TMS320C25  hi 
milliseconds.  The  table  is  adduced  for  the  case  of  two  channels  with  1024  elements  in 
realization. 


Operation 

Time 

Quadratures 

18.5 

Amplitudes  +  phases 

29.75 

Amplitudes  only 

25.25 

Separation  of  the  0/x 
components 

2.25 

Calculation  of 
threshold  of  signal 

2.25 

Estimation  of 
amplification 

in 

Sliding  average 
(16  elements) 

2.75 

‘Triangular  filter 

2.25 

Amplitude  +  phase  + 
sliding  average  filter 

32.5 

Amplitude  +  phase  + 
‘triangular  filter 

32.5 

4.  CONCLUSIONS 

Our  experience  of  construction  of  system  of  signal  processing  on  the  basis  of  fixed-pomt 
processor  TMS320C25  has  shown  opportunities  and  restrictions  of  such  systems.  The 
performance  of  processor  permits  to  process  the  signals  of  ionosphere  sounding  in  real  time, 
with  pulses  repetition  rate  up  to  25  Hz.  At  the  same  time  the  standard  is  the  Ifrequency  of 
pulses  50  and  even  100  Hz.  The  resolving  in  such  situation  can  be  the  use  more  fast  signal 
processors  (for  example  TMS320C50)  or  processors  with  total  system  of  arithmetical 
operations.  The  most  limiting  factor  of  use  of  processor  TMS320C2*  is  the  absence  of 
command  of  integer  division. 
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Abstract 

Positioning  high  frequency  antennas  on  mobile  platforms  is  usually  a  trade-off  between  adaptability  and 
antenna  performance.  Harris  is  presently  analyzing  another  factor,  electromagnetic  safety  of  personnel  within 
or  near  the  platform.  The  goal  is  to  compare  measured  empirical  data  to  data  predictions  from  different 
numerical  models  to  determine  the  models’  validity.  Then  use  the  numerical  models  to  determine  the  ideal 
antenna  mounting  position.  The  initial  effort  that  this  paper  documents  details  a  militarized  M-998  HMMWV 
platform  with  a  1 0.7  meter  whip  antenna  mounted  on  the  left  rear  fender  using  a  typical  military  mount. 

Our  results  indicate  that  in  localized  areas  the  IEEE  established  thresholds  for  electromagnetic  field  strengths 
are  just  exceeded  for  the  M-998  HMMWV  with  125  watt  transmitter  using  a  fender  mounted  whip  antenna. 
These  localized  areas  around  the  HMMWV  exterior  are  small  compared  to  the  size  of  an  average  person. 
Therefore,  the  average  person’s  body  is  not  totally  immersed  in  electromagnetic  fields  that  exceed  the  IEEE 
established  thresholds.  The  personnel  within  the  HMMWV  are  not  exposed  to  levels  of  electromagnetic 
energy  above  the  IEEE  established  threshold.  All  the  results  are  for  transmission  of  continuous  wave  (CW) 
power;  therefore,  all  results  can  be  relaxed  depending  on  modulation  and  duty  cycle. 

Introduction 

Military  and  personal  communications  use  a  variety  of  wireless  equipment  to  send  messages  and  data  from  1 
km  to  half  way  around  the  world.  The  use  of  high  frequency  (HF)  communication  equipment  is  ideal  for 
communication  beyond  line-of-sight  via  ionosphere  reflection.  HF  communications  from  mobile  plafrorms 
require  tactical  equipment  using  an  omni-directional  antenna  to  provide  continuous  communication  while  the 
platform  is  moving.  Under  certain  conditions  it  is  possible  that  the  electromagnetic  fields  radiated  from  the 
antenna  could  be  potentially  harmful  to  the  radio  operator  as  well  as  other  personnel  in  the  vicinity.  This  paper 
discusses  numerical  predictions  and  empirical  validation  to  determine  the  electromagnetic  field  strengths  about 
the  platform’s  exterior  and  within  the  platform.  The  field  strength  levels  are  compared  to  preset  thresholds  to 
determine  if  a  personnel  safety  hazard  exists. 

Mavimum  Permissible  Exposure  of  Electromagnetic  Fields 

According  to  IEEE  standards,  safety  to  personnel  from  electromagnetic  fields  and  contact  with  conductive 
objects  exposed  to  such  fields  is  a  function  of  body  resistance  and  capacitance  to  ground.  Our  work  was 
confined  to  determine  the  electromagnetic  fields  associated  with  platform  mounted  HF  systems  and  not  to  set 
thresholds  on  health  hazards.  More  specifically,  we  were  attempting  to  determine  whether  the  empirical 
electromagnetic  field  data  agreed  with  our  numerical  models  for  specific  platform  and  antenna  configurations. 

The  current  IEEE  guideline'  provides  the  Maximum  Permissible  Exposure  (MPE)  in  terms  of  rms  electric  (E) 
and  magnetic  (H)  field  strengths,  power  density  (S),  and  the  induced  currents  (I)  in  the  human  body.  The  MPE 
to  electric  and  magnetic  fields  shown  in  Figure  1  are  for  the  equivalent  vertical  cross-sectional  area  of  the 
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human  body.  These  MPE  field  levels  are  set  to  limit  the  maximum  inducted  current  over  the  body  to  100  ma 
through  each  foot  to  ground.  Over  a  six  minute  time  frame,  this  results  in  about  144  J/kg  in  the  average  68  kg 
person.  This  is  equivalent  to  a  specific  absorption  ratio  (SAR)  of  about  0.4  W/kg  averaged  over  the  entire  body 
mass. 

All  thresholds  are  set  based  on  normal 
healthy  people.  People  with  cardiac 
pacemakers  and  other  medical  devices 
can  be  in  danger  with  electromagnetic 
levels  below  the  IEEE  thresholds. 

Munitions  and  other  electronically 
detonated  explosives  should  be 
adequately  shielded  from  all 
electromagnetic  sources. 

When  field  strengths  are  above  these 
levels  current  induced  on  the  platform 
can  cause  an  RF  shock  or  bum  hazard. 

The  relationship  between  the  electric 
field  in  V/m  and  the  current  is  a 
function  of  metallic  surface  exposed. 

Two  levels  are  defined  for  humans: 
perception-  feel  a  tingling  sensation 
and  let-go-  maximum  current  that  the 
a  muscle  can  be  stimulated  and  still 
voluntarily  release^.  Perception  current  is  50  ma  at  1  MHz  is  and  100  ma  at  30  MHz.  The  let-go  current  is  150 
ma  at  1  MHz  and  220  ma  at  30  MHz.  These  current  levels  can  be  induced  on  a  compact  car  immersed  in 
approximately  50  V/m  and  100  V/m  for  perception  and  let-go  at  10  MHz.  As  the  size  of  vehicle  increases  the 
field  values  are  reduced  to,  for  example,  15  V/m  and  35  V/m  for  a  truck.  Of  course,  perception  and  let-go 
currents  are  dependent  on  the  size  of  person  and  type  of  touch.  Conductive  contact  measurements  with  rod 
electrodes  and  humans  determined  that  the  RF  bum  threshold  from  2  to  20  MHz  is  approximately  200  ma^. 

Platform  and  HF  Equipment 

The  M-998  High  Mobility  Multi-purpose  Wheeled  Vehicle  (HMMWV)  is  the  typical  military  vehicle  used  for 
mobile  platforms.  This  platform  can  be  delivered  in  many  different  forms,  this  paper  discusses  measurements 
made  on  an  open  platform  with  a  soft-top.  Harris  is  presently  performing  measurements  on  a  closed  platform 
as  well  as  a  number  of  different  antennas. 

Harris  designed  a  tactical  HF  radio  system  for  the  HMMWV  and  other  highly  mobile  vehicles.  The  HF  radio 
system  (AN/GRC-231)  has  the  following  equipment  cascaded  together:  receiver/transmitter  (RF-5022),  power 
amplifier  (RF-5032)  and  an  antenna  coupler  (RF-382)  connected  to  a  narrowband  antenna.  On  our  test 
HMMWV,  the  radio  equipment  was  mounted  in  the  typical  location,  between  the  vehicle’s  rear  seats.  The 
antenna  coupler  was  located  above  and  inside  the  left  rear  fender  close  to  the  base  of  the  AT-101 1  (10.7  m. 
whip)  antenna.  The  radio  configuration  is  specific,  but  any  generic  125  watt  radio  system  with  the  identical 
antenna  and  mounted  should  produce  identical  electromagnetic  fields. 

The  equipment  was  configured  to  transmit  125  watts  of  unmodulated  continuous  wave  (CW)  power  into  the 
antenna  coupler.  Continuous  wave  mode  represents  the  worst  case  electromagnetic  field  exposure. 
Modulation  or  reduction  in  duty  cycle  will  reduce  the  electromagnetic  field  levels. 
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Theoretical  Attenuation  with  Distance 


Theoretical  derivation  of  the  near  and  intermediate  electromagnetic  fields  generated  from  a  whip  antenna 
attached  to  a  complex  metallic  structure  is  difficult  without  significant  simplifications.  The  electromagnetic 
fields  were  evaluated  using  numerical  methods  as  discussed  later.  It  is  useful  to  examine  the  rate  of  attenuation 
from  a  simple  infinitesimal  element.  In  free  space,  an  infinitesimal  current  element  with  zero  thickness 
oriented  along  the  z-axis  will  give  rise  to  the  following  field  vector  components  . 
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where: 

k  -IkI'K  -  wave  number 


-  intrinsic  impedance  of  free  space 


4  -  uniform  current  distribution  magnitude 
I  -  infinitesimal  current  element  length 
r  -  distance  to  observation  point  in  meters 
X  -  wavelength  in  meters 


The  two  orthogonal  electric  field  vector  components  attenuate  at  different  rates.  Initially  when  distance  from 
the  antenna  is  small,  kr«l,  Eq  and  are  both  inversely  proportional  to  r^  when  the  distance  is  increased  to 
kr>\,  Er  is  inversely  proportional  to  r^  whereas  Ee  is  inversely  proportional  to  r.  At  large  distance,  kr»\, 
only  the  remaining  Ee  and  components  make  up  the  propagating  wave. 


Our  analysis  is  concerned  with  near  and  intermediate  electromagnetic  field.  Based^on  the  simple  dipole 
attenuation,  our  data  should  fall-off  at  1/r^  close  to  the  antenna  then  at  a  mixture  of  1/r  and  1/r  depending  on 
the  magnitudes  of  the  individual  components.  Figure  2  shows  theoretical  curves  of  the  expected  attenuation 
versus  distance.  The  two  dashed  curves  show  attenuation  rates  for  different  initial  Ee  and  Ef.  The  solid  curve 
is  numerical  data  from  modeling  the  whip  and  platform  over  real  earth  at  2  MHz.  In  a  real  environment,  the 
magnitude  of  Ee  and  Brand  even  are  combinations  of  reflections  from  nearby  objects. 
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Numerical  Predictions 


The  Numerical  Electromagnetic  Code  Version  4^  (NEC-4)  is  a  well-known  implementation  of  the  moment 
method^  algorithm.  NEC-4  is  considered  the  "gold-standard"  electromagnetic  modeling  tool,  having  achieved 
this  status  through  constant  improvement  and  extensive  validation  performed  by  many  independent  users. 
NEC-4  computes  the  current  distribution  on  a  segmented  wire-grid  model  of  the  HMMWV  platform  with  the 
10.7  m  whip  antenna.  The  current  distribution  in  each  wire-grid  segment  is  then  used  to  compute  near  electric 
and  magnetic  field  strengths.  The  platform  is  electrically  close  to  the  ground  and  therefore  requires  the  use  of 
the  Sommerfeld-Norton  formulation  to 
correctly  mode!  the  interaction  at  the  earth- 
air  interface. 

The  antenna  and  conductive  vehicle  surface 
have  intense  interaction,  requiring  accurate 
numerical  modeling.  The  HMMWV  wire- 
grid  model  was  created  using  an  interactive 
graphical  user  interface,  EAMiNEC’. 

Figure  3  displays  the  resulting  numerical 
model  used  to  compute  the  electromagnetic 
fields.  A  soft-top  HMMWV  was  used  for 
testing  which  represents  the  worst  case 
exposure  to  the  passengers.  The  wire-grid 
model  contains  386  wire  segments  to  model 
the  entire  outer  conductive  surface 
including  the  undercarriage*.  The  wire-grid 
radius  and  spacing  were  adjusted  to 
correctly  model  a  solid  surface  as  defined  by  Ludwig’s  “twice  surface  area”  criteria^.  The  entire  model  is 
situated  such  that  the  10.7  meter  whip  is  aligned  with  the  z-axis  in  the  Cartesian  coordinate  system. 

Measurements 

To  approximate  realistic  conditions,  the  HMMWV  was  situated  in  a  large  open  paved  parking  lot  free  of 
metallic  obstacles.  Measurements  were  performed  in  linear  paths  away  from  the  platform  centered  on  the 
antenna.  Frequencies  were  measured  at  selected  allocated  frequencies  that  are  close  to  even  integers 
throughout  the  HF  band.  To  correctly  represent  the  true  operational  scenario,  the  HMMWV  battery  powered 
the  radio  system.  A  watt  meter  insured  that  125  Watts  of  power  was  delivered  to  the  antenna  coupler. 

The  electromagnetic  field  strengths  were  measured  using  several  small  probes.  A  number  of  probes  were  used: 
a  3.6  cm.  ball  probe,  a  calibrated  1  meter  long  probe  and  a  Holaday  3-axis  probe.  The  magnetic  field  was 
measured  with  an  3.0  cm.  loop,  a  10  cm.  loop  and  Holaday  3-axis  probes.  The  majority  of  the  tests  were 
measured  with  the  3.6  cm.  ball  and  3.0  cm.  loop  probes. 

Electromagnetic  fields  vary  along  the  body  length.  The  human  body  has  different  ^ecific  absorption  rates 
(SAR)  along  its  surface.  The  highest  SAR  levels  are  shown  to  be  on  the  neck'  .  Measurements  were 
performed  with  each  probe  on  a  non-metallic  tripod  at  165  cm.  height.  The  signal  received  on  these  probes 
were  connected  to  a  spectrum  analyzer  via  a  short  length  of  coaxial  cable.  The  tripod  was  moved  along  a  linear 
path  away  from  the  antenna  at  0.5,  1,  2, 4,  8,  and  16  meters. 

HMMWV  interior  electromagnetic  fields  were  measured  at  the  location  of  the  four  seats.  Two  points  were 
measured  at  each  seat  location;  waist  level  and  neck  level  in  the  seated  position. 
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The  time  involved  with  gathering  empirical  data  limits  the  amount  of  data  that  was  obtained.  The  empirical 
data  was  used  to  validate  and  verify  the  numerical  predictions.  All  results  displayed  compare  both  measured 
and  modeled  electric  and  magnetic  field  strengths. 


Exterior  Electric  Fields 

The  curves  in  Figure  4  show  the  measured  and  modeled  electric  field  strength  versus  distance  from  the 
antenna.  The  2  MHz  data  has  excellent  agreement  in  magnitude  and  attenuation  rate  versus  distance.  At  4 
MHz,  the  agreement  is  good  overall,  but  the  1/r^  reduction  rate  is  not  seen  in  the  measured  data.  At  8  MHz,  the 
measured  data  has  the  1/r^  reduction  rate  while  the  numerical  doesn’t,  but  overall  the  data  compares  well.  At 
16  MHz,  the  data  has  excellent  agreement  in  magnitude  and  attenuation  rate.  The  numerical  data  at  24  MHz 
disagrees  with  the  theoretical  attenuation  rate,  where  the  measured  data  agrees  with  the  theoretical  curve.  At 
30  MHz,  the  measured  and  modeled  data  attenuates  at  different  rates.  Overall  there  is  good  agreement  between 
the  measured  data  and  modeled  data,  particularly  at  2  and  16  MHz.  The  numerical  model  data  tracks  the 
measured  data  well  close  to  the  HMMWV  at  the  lower  frequencies,  but  above  24  MHz  the  attenuation  rate 
diverges.  All  the  field  strength  curves  in  Figure  4  are  below  the  IEEE  thresholds  (Figure  1).  These  curves 
represent  attenuation  in  one  linear  path  away  from  the  antenna  and  not  all  around  the  HMMWV. 


Figure  4.  Measured  and  Modeled  Electric  Field  Strength  Versus  Distance 

The  platform  and  the  antenna  mounting  lack  symmetry  resulting  in  asymmetric  electromagnetic  fields.  Figure 
5  shows  a  contour  plot  looking  down  on  the  platform  with  the  antenna  located  at  x  =  0  and  y  =  0.  The  plot 
shows  the  magnitude  of  the  electric  field  out  to  5  meters  at  a  1.65  meter  height  for  2  MHz.  This  figure  has  a 
contour  line  ever>'  100  V/m  starting  with  levels  above  the  IEEE  threshold  of  614  V/m.  The  rectangular 
HMMWV  influence  on  field  strength  is  clearly  evident.  Threshold  levels  are  exceeded  around  the  rear  of  the 
platform  and  up  the  driver’s  side.  If  the  driver  has  his  arm  out  the  window  the  arm  would  be  exposed  to  levels 
that  exceed  the  IEEE  threshold.  The  electric  field  levels  attenuate  outwardly  at  different  rate  due  the  off  center 
antenna  mounting.  This  figure  reveals  that  an  electric  field  null  exists  within  the  passenger  space. 

The  strongest  electromagnetic  fields  are  at  the  HMMWV  rear  surface  edges,  they  exceed  the  IEEE  thresholds 
shown  in  Figure  1.  The  size  of  the  exceeded  volume  is  not  large  enough  for  a  person  to  be  confined  within. 
Therefore,  at  2  MHz  the  worst  case  condition  does  not  constitute  an  electromagnetic  danger.  The  driver  is  not 


homogeneously  exposed  to  this 
level  so  at  most  only  a  body 
portion  is  exposed.  Localized 
SAR  of  8  W/kg  averaged  over 
one  gram  of  body  mass  is 
deemed  the  localized  threshold. 
This  level  is  20  times  the  level 
for  homogenous  exposure,  or 
approximately  4.5  times  the 
electric  field.  These  localized 
levels  only  just  exceeded  MPE 
levels  and  are  below  4.5  times 
the  614  V/m  level. 

Similar  testing  was  performed 
using  the  AN/GRC-193  radio 
with  a  4.6  meter  vehicular  whip 
antenna.  This  system  can 
produce  power  density  levels  in 
excess  of  1 .0  mW/cm^  to  a 
distance  of  1.1  m  from  the 
antenna  with  100  w  of  power". 

Exterior  Magnetic  Fields 


Figure  5.  Electric  Field  versus  Distance,  Whip  Antenna  at  (0,0) 


Figure  6  shows  the  measured  and  modeled  magnetic  field  strength  levels.  The  agreement  between  measured 
and  model  magnetic  field  strengths  are  exceptionally  good.  The  16  MHz  data  shows  a  dip  a  2  meters  that  is 
seen  in  both  the  measured  and  modeled  data.  At  30  MHz,  the  initial  rise  in  field  strength  is  seen  with  both  the 
measured  and  modeled  data.  Both  24  and  30  MHz  curves  have  an  abrupt  oscillation  in  the  modeled  data  at  10 
meters  from  the  antenna.  Our  measured  data  is  at  8  and  16  meters,  so  the  oscillation  at  10  meters  can  not  be 
validated.  Overall  the  data  has  exceptional  agreement;  therefore  we  feel  that  these  oscillations  are  correct.  All 
these  curves  are  far  below  the  IEEE  thresholds  shown  in  Figure  1 . 
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Figure  6.  Measured  and  Modeled  Magnetic  Field  Strength  Versus  Distance 
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Interior  Electric  Fields 

Measurements  were  taken  within  the  HMMWV  at  each  of  the  four  seats.  In  each  seat,  two  data  points  were 
taken;  one  at  0.1  meters  above  the  seat  and  0.2  m  below  the  roof  It  was  found  that  there  is  a  difference  in 
driver's  side  and  passenger’s  side,  but  only  a  small  difference  in  height  and  front  or  back  seats.  Shown  in 
Figure  7  are  two  dashed 
curves  that  represent  the 
measured  data  on  the 
driver’s  side  and 
passenger’s  side  across 
the  frequency  band.  The 
numerical  data  shows 
driver’s  side  front  and 
back  seat  and  passenger’s 
side  front  and  back  seat. 

The  IEEE  threshold  is 
also  shown  as  the  solid 
line.  At  2  MHz,  the  data 
is  on  the  fringe  of 
exceeding  MPE.  As  the 
frequency  is  increased, 
the  measured  and 
modeled  data  drops  well 
below  threshold  levels  up 
to  16  MHz.  Above  16 
MHz,  the  electric  field 
levels  increase  to  the 
threshold  level  at  24 
MHz,  then  decrease. 

RF  Shock 

The  limited  ability  to  ground  a  mobile  platform  results  in  circulating  currents.  These  currents  often  are  high 
enough  to  cause  RF  bums  depending  on  the  antenna’s  input  impedance.  When  a  narrow-band  antenna  is 
excited  with  a  frequency  so  that  the  antenna  appears  electrically  short,  high  voltages  in  the  order  of  thousands 
of  volts  are  developed  at  the  coupler’s  output  port.  A  safety  hazard  exists  from  the  RF  current  present  on  the 
coupler’s  output  port,  interconnecting  wire,  and  antenna  base  anytime  the  HF  communications  system  is  in  the 
transmit  mode.  During  transmit  the  coupler’s  output  port,  interconnecting  coupler  wire,  or  the  antenna  base 
should  not  be  touched. 


•  ps-fr*modei 
A  pS'bK'tTiodel 

•  dr-fr*  model 
m  df-bH-model 
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Frequency  (MHz) 

Figure  7.  Measured  and  Modeled  Electric  Field  Versus  Frequency  Within  the  HMMWV 


Conclusions 

Harris  has  recently  analyzed  and  is  continuing  to  analyze  electromagnetic  safety  of  personnel  within  or  m 
proximity  of  mobile  platforms  with  tactical  HF  radios.  Our  goal  is  to  determine  if  IEEE  maximum  permissible 
exposure  thresholds  are  exceeded  around  different  antennas  on  different  platforms  versus  frequency.  The 
effort  involves  numerical  predictions  with  validation  and  verification  by  empirical  methods.  The  results 
documented  in  this  paper  are  for  a  militarized  M-998  HMMWV  platform  with  a  10.7  meter  whip  antenna 
mounted  on  the  left  rear  fender  using  a  typical  military  mount.  Any  change  of  platform,  radio,  antenna,  or 
mounting  may  not  produce  the  same  results. 
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Overall,  there  is  good  agreement  between  the  measured  data  and  modeled  data,  both  external  and  internal  to  the 
platform.  The  numerical  model  tracks  the  measured  data  well  close  to  the  HMMWV  at  the  lower  frequencies. 
Our  results  conclude  that  IEEE  MPE  thresholds  are  exceeded  for  localized  areas  around  the  external  surface  of 
the  M-998  with  125  watt  transmitter  using  a  fender  mounted  whip  antenna.  These  localized  areas 

around  the  HMMWV  are  small  compared  to  the  size  of  an  average  person.  Therefore,  the  average  person’s 
body  is  not  totally  immersed  in  electromagnetic  fields  that  exceed  the  IEEE  MPE  thresholds.  The  IEEE 
localized  thresholds  are  invoked  which  are  well  above  our  measured  field  strengths.  The  personnel  within  the 
HMMWV  are  not  exposed  to  levels  of  electromagnetic  energy  above  the  IEEE  MPE  threshold.  All  the  results 
are  for  CW  mode;  therefore,  even  the  small  localized  hot  areas  can  disappear  depending  on  modulation  and 
duty  cycle. 

To  remain  outside  any  localized  area  that  exceeds  the  IEEE  MPE  thresholds  of  electromagnetic  field  strengths 
in  CW  mode  a  safety  zone  contour  is  recommended.  The  safety  contour  for  a  10.7  meter  rear  fender  mounted 
whip  antenna  on  a  M-998  HMMWV  operating  at  125  watts  is  as  follows; 

A  distance  of  0.5  meters  away  from  the  HMMWV  conductive  surface  starting  at  the  front  of 
the  driver's  side  door  proceeding  towards  the  antenna  to  the  HMMWV  left  rear  comer,  along 
the  back  to  the  right  rear  comer,  and  back  up  to  the  rear  of  the  passenger’s  side  door.  At 
distances  greater  than  0.5  meters,  all  electromagnetic  field  levels  are  below  the  IEEE  MPE 
thresholds. 

During  HF  transmission,  as  with  any  radio,  an  RF  shock  hazard  exists  if  physical  contact  is  made  with  the 
antenna  coupler’s  output  port,  coupler’s  interconnect  cable,  or  antenna  base. 
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Abstract 


The  wave-plasma  interaction  is  considered  with  taken  into  account  the  thermal  con¬ 
ductivity.  It  is  proposed  the  computational  algorithm  to  solve  the  nonlinear  problem.  It 
is  shown  that,  the  thermal  nonlinearity  creates  additional  duct.  This  interaction  makes 
the  wave  beam  trajectory  less  steep. 

1.  Intro  duct  ion 

The  problem  of  nonlinear  wave  plasma  interaction  appears  when  a  powerful  EM  wave 
propagates  in  plasma  medium.  Usually  such  interaction  is  described  by  means  of  the  non¬ 
linear  wave  equation  [1]  or  some  approximation  such  as  parabolic  equation  of  diffraction 
theory  [2].  However,  such  description  is  possible  only  in  the  case  of  negligible  thermal 
conductivity.  To  take  into  account  the  thermal  losses,  it  is  necessary  to  include  another 
equation  describing  the  electron  density  disturbance  initiated  by  the  powerful  electric 
field. 


2.  Analysis 


We  analyze  a  problem  of  intensive  wave  beam  propagation  in  the  ionosphere.  The 
small  region  is  considered  near  the  reflection  point  where  the  interaction  of  the  beam 
with  the  ionospheric  plasma  is  the  most  effective.  We  represent  the  wave  electric  field  in 
the  form  of  E  (.r,  y)  =  U  {x,  y)  ■  exp  {iKx),  where  x,  y  are  Cartesian  coordinates,  K  is  the 
wave  number.  For  the  wave  amplitude  U  we  can  write  down  the  parabolic  equation  [3] 


£-  being  the  dielectric  permittivity.  The  electric  field  heats  the  plasma  causing  a 
disturbance  of  the  electron  temperature.  The  distribution  of  the  electron  temperature  is 
described  by  the  heat  equation  across  the  wave  propagation  direction. 


L 


^d^AT 

dy"^ 


-  AT - 


•  To  =  0. 


Here,  AT  is  the  disturbance  of  the  electron  temperature.  To  is  the  background  electron 
temperature,  L  is  the  characteristic  scale  of  the  thermal  conduction,  Ej,  is  the  character¬ 
istic  plasma  field  {E^  =  STof m,  e  are  the  charge  and  the  mass  of  the  electron,  u 
is  the  radiation  frequency,  8  is  the  average  part  of  the  energy  which  the  electron  losses 
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at  one  collision.  Following  [4],  we  can  write  down  the  equation  for  the  electron  density 
diffusion 


^  d^AN  ,  d^AT  ^ 

Tq  j'  j. - 1-  kxNo — r-:;:-  =  0 


dj/2 


dj/2 


Here,  kj  is  the  coefficient  of  thermal  diffusion,  AN  is  the  disturbance  of  the  electron 
density,  A^o  is  the  not  perturbed  electron  density.  Dielectric  permittivity  e  is  connected 
with  the  electron  density  N  =  Nq  +  AN  by  the  relation 


e  = 


47re2 
- jA 


Thus,  we  can  easily  obtain  the  equation  of  diffusion  across  the  direction  of  the  wave 
propagation  for  the  dielectric  permittivity  perturbation  Ae 


Ct  LAL  o 

-—-a-Ae- 0-\Vf  =  O 
dy^ 

Where  a  =  kjfL^  is  the  coefficient  of  thermal  losses,  /3  =  a—^No  is  the  coefficients  of 
nonlinearity.  Thus  we  have  a  set  of  two  equations  for  describing  nonlinear  interaction  of 
the  wave  field  with  the  ionospheric  environment.  We  can  assume  that  the  coefficients  of 
the  second  equation  do  not  vary  in  the  considered  region  because  this  region  is  sufficiently 
small.  In  this  case,  we  can  obtain  the  solution  of  this  equation  in  terms  of  an  integral 
representation 


As  =  h  J  \U{x,z)\^  exp  {-a \y  -  z\)dz 

“CO 

where  a  =  Var,  6  =  Pl{2a).  Thus  we  can  reduce  our  problem  to  one  integral  - 
differential  equation. 


OX  oy^ 


00 

£o  -  1  +  ^>  J  \U[x,z)\^  exp{~a\y  -  z\)dz 


C/’  =  0, 


where  So  =  £0(3^,  y)  -  is  the  background  dielectric  permittivity. 

This  representation  is  useful  for  analytical  investigation  but  for  numerical  procedure 
maybe,  it  is  more  convenient  to  use  the  system  of  two  equation. 

3.  Numerical  method 


Now,  we  can  replace  the  differential  operator  by  their  difference  analogue  and  using  the 
sweep  method  we  can  obtain  the  numerical  solution  of  the  nonlinear  problem.  However, 
the  variance  relation  of  the  difference  equation  is  significantly  distinguished  from  variance 
relation  of  the  differential  equation  for  higher  harmonics  [5].  This  fact  is  not  terrible 
for  linear  problems,  because  we  can  put  sufficient  number  of  points  of  the  difference 
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network  for  the  initial  date  approximation.  But  in  the  nonlinear  problem,  new  harmonics 
can  appear  in  the  process  of  the  propagation  and  we  can  not  anticipate  a  priory  the 
spectrum  limits  of  the  solution.  Therefore  we  have  modified  the  well  known  decomposition 
method  to  solve  the  our  problem.  The  construction  of  the  numerical  solution  on  the  step 
rj,x,+i  =  Xj  +  Ax  is  decomposed  into  two  stage-difFraction  step 


and  refraction  step 


2iK^  +  K^(e-  1)U2  =  0 

OX 

where  Ui(xj,y)  =  Uixj,y),  O^xj,  y)  =  D\ixj+i,y),  (/(xj+i,y)  =  C/2(xj+i,  y)  To  avoid 
the  difference  derivative  across  the  wave  propagation,  we  construct  some  difference  ana^ 
logue  of  the  generalized  solution  of  the  diffraction  equation.  For  this  aim,  we  designate 
V^.(y)  =  [/i(xj,y),  and  rewrite  the  first  step  equation  in  the  form 


2iK- 


Ax 


dy^  dy^ 


(1) 


Now,  we  consider  the  scalar  product  of  this  equation  on  a  sufficiently  smooth  function 
ip  which  is  satisfied  the  boundary  conditions.  Then  we  can  transfer  the  differentiation 
from  the  function  V  to  the  function  p  and  thus,  we  can  get  the  relation 


+ 


cPp 

dy^ 


=  0 


were  7  =  ^  is  the  complex-valued  constant.  Next,  we  approximate  the  functions 
p{y)  and  Vj{y)  by  its  difference  analogues  at  N  points  of  across  coordinate.  We  construct 
N  scalar  product  using  N  linearly  independent  functions  pn-  Thus,  we  obtain  the  set  of 
N  equation  for  the  definition  of  the  vector  v  j+i  which  is  the  difference^  analogue  of  the 
function  Vjj^i{y)  .Introducing  matrixes  A  and  B.  with  elements  +  ^1  -Pn-  and 

=  -  7  •  c/Jn  at  the  points  of  the  network  yra-,  we  can  write  down  the  relation 

~v  [a  ^  ■  V  j 

Thus,  it  is  enough  to  calculate  one  time  the  transition  matrix  ■  B)  and  then  to 
use  this  matrix  every  time  passing  from  layer  to  layer.  It  is  most  easily  to  calculate  the 
transition  matrix  if  the  eigenfunctions  of  the  differential  operator  are  used  as  the  base 
functions  Pn 


4.  Conclusion 


In  the  case  of  the  characteristic  scale  of  thermal  conduction  L  being  very  small,  the 
problem  can  be  reduced  to  the  well-known  nonlinear  parabolic  equation.  However,  if  this 
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scale  is  not  negligible,  a  numerical  procedure  is  to  be  used.  The  numerical  solution  of  this 
problem  allows  us  to  investigate  the  deformation  of  the  wave  form  solution  by  thermal 
diffusion.  For  the  ionosphere  the  nonlinearity  is  very  small  (/5  <<  1)  and,  therefore,  we 
can  use  the  perturbation  method.  At  the  reflection  point  the  wave  field  is  sufficiently 
well  described  by  the  oscillating  Airy  function  (fig-l).  But  the  disturbance  of  dielectric 
permittivity  is  described  by  a  smooth  function  (fig.2).  Thus  the  thermal  nonlinearity 
creates  some  additional  duct  for  the  wave  propagation  which  makes  the  trajectory  of 
the  beam  more  slanting.  As  result,  the  exit  angle  of  wave  from  the  ionosphere  is  not 
equal  to  the  entrance  angle  (fig. 3).  This  conclusion  is  in  a  very  good  agreement  with  the 
experimental  results  [2]. 
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1.  Introduction. 

Radio  frequency  (RF)  propagation  path  loss  through  trees  and  foliage  determines  mainly  the 
distance  of  radio  communication  in  forest.  Usually  path  loss  of  radio  waves  in  forests  is  far  more 
than  in  clear  air  and  depends  strongly  on  different  conditions  (moisture,  density  of  trees,  foliage, 
kind  of  wood).  By  estimations  the  attenuation  of  radio  waves  due  to  foliage  can  vary  at  400MHz 
from  100  dB/km  for  sparse  forest  to  1000  dB/km  for  very  dense  forest  [1],  Extremely  high  path 
loss  of  radio  waves  in  dense  forest  makes  radio  communication  there  almost  impossible.  Under 
certain  conditions  the  radio  waves  can  propagate  in  forest  at  far  more  distances  than  it  could 
be  expected  from  estimations.  For  such  conditions  the  attenuation  of  radio  waves  due  to  foliage 
can  be  40—50  dB/km  [1].  This  effect  can  be  explained  by  formation  of  lateral  radio  waves  in 
the  model  where  forest  is  described  as  dielectric  slab.  For  prediction  of  path  loss  for  different 
conditions  several  mathematical  models  can  be  used  [4],  [1]. 

In  present  paper  results  of  measurements  of  radio  wave  attenuation  in  the  forest  and  outside  of 
forest  are  presented.  Features  of  radio  wave  propagation  in  clear  air  outside  forest  and  in  forest  are 
compared  and  discussed.  Spectra  of  received  signals  are  presented.  The  results  of  measurements 
of  radio  wave  attenuation  are  compared  with  the  theoretical  estimations. 

2.  Experiment, 

Measurements  of  radio  wave  attenuation  were  performed  in  1995  in  Troitsk  forest  near  Moscow, 
Russia.  For  experiment  a  special  acquisition  system  was  elaborated.  Hardware  included  portable 
transmitter,  super  sensitive  receiver  and  a  personal  computer  with  16  bit  resolution  DAC/ADC 
digital  board  inside  it.  Available  software  allowed  to  select  one  of  four  discrete  gain  amplifier 
ranges  of  DAC/ADC  digital  board  and  change  the  gain  of  digital  board  amplifier  within  each  range 
continuously.  Additionally  there  was  an  opportunity  to  change  the  gain  of  receiver  amplifier  from 
2  to  10000  manually.  This  gave  very  wide  total  dynamical  range  of  acquisition  system.  During 
experiment  the  receiver  was  located  at  the  same  point  near  the  forest  while  the  transmitter  was 
moving  in  forest  in  transverse  direction  to  the  border  of  forest  and  then  in  opposite  direction 
outside  of  forest.  For  controlling  purposes  an  independent  voice  channel  in  different  frequency 
range  was  additionally  used.  The  positions  of  transmitter  were  exactly  determined.  Transmitter 
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(a)  short  distance  signal  from  forest  transmitter 


1 


(c)  long  distance  signal  from  forest  transmitter 


(d)  clear  air  long  distance  signal  from  outside  of  forest  transmitter 
Figure  1.  Waveforms  of  received  signals. 
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forest  transmitter 

Figure  2.  Spectra  of  received  signals. 
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worked  in  pulse  mode.  The  data  were  acquired  into  personal  computer,  digitized  and  saved  in 
binary  format.  For  each  record  the  information  about  gain  of  amplifiers  and  position  of  transmitter 
were  kept. 


3.  Characteristics  of  received  signals. 

Waveform  of  signal  from  transmitter  located  in  forest  at  short  distance  is  shown  in  Figure  1  (a). 
On  the  left  and  right  sides  is  the  same  signal.  Both  pictures  represent  the  beginning  of  transmitter 
pulse.  On  the  right  side  the  signal  is  shown  in  magnified  scope  to  reveal  the  detail  structure  of 
registered  signal.  The  transmitter  was  at  a  short  distance  and  the  level  of  signal  is  far  more  than 
the  level  of  noise.  So  the  signal  has  the  pure  sine  form.  The  spectrum  of  this  signal  is  shown 
in  Figure  2(a).  The  width  of  spectral  pick  in  Figure  2(a)  is  determined  mainly  by  the  length  of 
emitted  transmitter  pulses.  The  pulse  structure  of  emitted/received  signals  can  be  seen  in  Figure 
1(d).  Here  the  signal  from  transmitter  located  outside  of  forest  at  large  distance  is  represented.  In 
Figure  1(d)  several  pulses  are  shown.  Because  of  the  large  distance  the  level  of  noise  is  compared 
with  the  level  of  useful  signal.  So  the  form  of  pulses  in  Figure  1(d)  is  masked  by  noise. 

Signal  from  transmitter  located  in  forest  at  middle  distance  is  shown  in  Figure  1(c).  As  earlier 
on  the  left  and  right  side  is  the  same  signal  in  different  scales.  In  right  side  picture  it  is  seen  that 
signal  contains  small  component  of  high  frequency  noise.  This  component  grows  with  increasing 
of  distance  between  transmitter  and  receiver.  The  larger  distance  the  more  the  volume  of  noise 
component.  The  signal  from  transmitter  located  in  forest  at  large  distance  is  shown  in  Figure 
1(e).  High  frequency  noise  component  increases  and  becomes  compared  with  the  amplitude  of 
useful  signal.  The  spectra  of  signals  from  transmitter  located  in  forest  at  middle  and  long  distance 
are  shown  in  Figures  2(c),  2(e).  It  is  seen  from  comparison  of  right  sides  of  Figures  l(a)-T-l(d) 
that  noise  component  can  significantly  distort  the  form  of  received  signal. 

Analysis  of  signals  shows  that  noise  component  of  signals  differs  for  transmitter  located  in 
forest  and  outside  of  forest.  The  difference  is  understandable  from  comparison  of  right  sides  of 
Figures  1(c)  and  1(d),  where  in  the  same  scope  the  signals  from  transmitter  located  in  forest  and 
outside  of  forest  are  shown.  The  form  of  signal  received  from  outside  of  forest  transmitter  is  more 
pure  than  the  form  of  signals  received  from  the  forest. 

Although  the  spectrum  of  received  signal  with  noise  component  can  be  very  wide  the  spectral 
intensity  of  useful  signal  is  more  than  the  noise  component  in  frequency  range  of  useful  signal. 
This  allows  to  extract  useful  signal  from  total  signal  using  narrow  frequency  band  flittering.  The 
spectra  of  filtered  signals  from  transmitter  located  in  forest  at  middle  and  large  distances  are 
shown  in  Figures  2(d),  2(f).  It  is  seen  that  filtering  removes  from  signal  low  and  high  frequency 
noise  component.  To  reduce  the  influence  of  noise  component  on  final  results  all  signals  were 
passed  through  narrow  frequency  band  filter.  Obtained  data  were  statistically  processed  and 
average  amplitude  of  useful  signal  was  calculated. 


4.  Attenuation  of  radio  waves  in  forest. 

The  results  of  measurements  of  attenuation  of  radio  waves  are  shown  in  Figures  3,4,  where  are 
represented  the  plots  of  decreasing  with  distance  of  average  amplitude  of  received  signals  for  clear 
air  propagation  (Figure  3)  and  for  propagation  through  the  forest  (Figure  4).  The  amplitude 
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in  figures  is  proportional  to  the  strength  of  electrical  field  of  received  radio  waves.  In  figure  3 
the  plot  of  attenuation  of  radio  waves  in  clear  air  with  exponential  and  power  fits  is  represented. 
Similar  plots  for  the  case  when  transmitter  was  in  forest  are  shown  in  Figure  4.  The  behavior 
of  curves  in  both  pictures  is  similar.  The  amplitude  of  signal  reduces  sharply  with  distance. 
Numerical  analysis  of  experimental  data  gives  extra  134  dB/km  attenuation  of  radio  waves  in 
forest  in  comparison  with  clear  air  propagation. 

The  theoretical  rate  of  attenuation  of  radio  waves  passing  through  trees  can  be  evaluated  from 
the  expression  [1],  [2],  [3] 


Lp  = -20loge"2'^^  (4.1) 

where  Lp  -  loss  rate  in  dB/m,  e  =  2.71828,  u  -  conductivity  in  siemens,  /  -  frequency  in  MHz. 
The  family  of  curves  representing  path  loss  for  various  conductivities  is  available  in  [l].  Reversing 
(4.1)  gives  a  convenient  method  of  evaluation  of  conductivity  a  using  path  loss  data  [1].  Under 
conditions  of  experiment  the  evaluation  of  I/p  using  (4.1)  gives  value  Lp  =  152  dB/km.  This  is 
enough  good  agreement  with  the  value  obtained  from  the  experiment  because  conductivity  o  of 
forest  can  be  determined  rather  approximately. 


5.  Discussion. 

Experimental  data  show  strong  attenuation  of  radio  waves  in  forest.  The  distance  of  radio  com¬ 
munication  in  forest  for  condition  of  experiment  was  almost  in  2  times  less  than  for  clear  air 
propagation.  The  numerical  value  of  path  loss  obtained  from  experimental  data  is  slightly  less 
than  evaluated  for  propagation  through  forest.  From  other  side  this  value  is  far  more  than  for 
the  situation  when  the  lateral  wave  could  arise.  Numerical  analysis  of  asymptotes  of  curves  of 
radio  wave  attenuation  indicates  faster  than  for  lateral  wave  decreasing  of  wave  amplitude  with 
distance.  Formation  of  lateral  wave  in  forest  is  an  attractive  opportunity  to  increase  the  distance 
of  radio  communication.  Conditions  when  lateral  waves  could  arise  should  be  investigated  more 
thoroughly. 
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1.  Introduction 

Recently  microwave  techniques  incorporating  advances  in  signal  processing  have  been  developing 
rapidly.  In  particular  mapping  of  sub-surface  random  ground  and  detection  of  underground  objects  are 
important  applications.  As  a  consequence,  underground  radar  systems  which  display  the  scattering  field 
patterns  of  electromagnetic  waves  have  become  practical  [I]- [4].  However  due  to  randomness  and 
inhomogeneous  nature  of  the  underground  media,  it  is  very  dilTicult  to  analyse  their  characteristics 
accurately.  In  principle,  it  is  necessary  to  study  electromagnetic  wave  reflection  and  scattering  under 
various  conditions  of  underground  media. 

One  of  tlie  most  powerful  methods  of  computational  analysis  applicable  to  such  problems  is  the  Finite 
Difference  Time  Domain  (FDTD)  method.  In  this  paper  we  apply  FDTD  method  to  simulate  the  scattering 
characteristics  of  an  underground  media  consisting  of  an  air  gap  target  region  in  the  presence  of  randomly 
located  obstacles.  Numerical  results  are  given  for  various  cases  of  the  medium  properties  like  tire  medium 
electric  permittivity  and  the  number,  size  and  location  of  obstacles. 

2.  Analysis  Model 

The  two  dimensional  analysis  model  is  schematically  displayed  in  Fig.  1  and  a  general  FDTD  modelling 
in  Fig.2.  We  consider  a  rectangular  region  consisting  of  a  target  air-gap  and  obstacles  located  randomly 
relative  to  it.  The  figure  shows  that  the  origin  is  located  at  Uie  ground  level,  implying  that  die  mapping  is 
conducted  at  a  short  distance  above  die  ground.  Tlie  shapes  of  air-gap  as  well  as  obstacles  are  rectangles. 
Typical  numerical  values  considered  are:  2.56m  along  x-axis,  0.44m  above  the  ground,  2.56m  below 
ground  for  the  analysis  region.  The  size  of  the  air-gap  is  Im  x  0.5m  and  is  located  0.5m  below  die  ground. 

The  analysis  corresponds  to  illuniinadng  the  region  by  an  electromagnetic  monopulse  having  spado- 
temporal  distribution,  and  analysing  the  reflected  fields.  Tiis  can  be  pracdcally  done  by  placing  a  planar 
antenna  along  the  x-axis  above  die  ground.  The  lengUi  of  the  antenna  is  typically  0.3m  and  isplaced  0.1m 
above  die  ground.  Except  for  the  target  air-gap  and  the  obstacles  which  are  random,  the  medium  is 
considered  non-dispersive  and  homogeneous  wiUi  a  reladve  permitdvity  £^=4.0. 

3.  Source  Modelling 

Source  modelling  essenUally  consists  of  choosing  an  appropriate  current  distribudon  funedon 
corresponding  to  illuminaUon  of  the  analysis  region  by  a  pulse  wave  from  a  planar  antenna.  This  current 
distribution  is  the  source  term  in  the  Maxwell's  equadons  which  determine  the  reflected  and  scattered  fields 
along  with  the  boundary  conditions  on  die  region.  In  order  to  illuminate  a  wide  subsurface  region  of  0.3m 
long,  the  current  source  distribudon  is  characterized  by  sinusoidal  funedon  in  the  spadal  domain  (x- 
direction)  (Fig. 3),  and  by  a  Gaussian  funedon  in  the  dme  domain  (Fig.4). 

Asin^-^xjexpj-flj 

0  otherwise 

where  A,  n  ,  G  and  L  are  100,  30,  5nsec,  0.3m  respecdvely. 

4.  FD-TD  Method 

Characterisdes  of  electromagnedc  waves  are  expressed  by  Maxwell's  curl-equadons.  FD-TD  method 
proposed  by  K.S.Yee[5]  is  used,  in  which  these  pardal  differendal  equadons  are  approximated  by 
difference  equadons  first,  and  then  the  electric  and  magnedc  fields  are  calculated  mutually  from  inidal 
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values  at  every  incremental  Ume  At.  For  Uie  t\TO-diniensional  model  shown  Fig.  1,  and  Maxwell’s  curl* 
equations  are  transformed  to  die  following  difference  equations. 
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Where  p  and  q  are  space  coordinates,  n  is  a  time  step  number.  And  Ax=Az=As  and  At  denote  the  spacial 
increment  and  temporal  increment,  respectively.  These  increments  have  to  satisfy  computational  stability 
condition.  So  in  this  study,  we  set  As=0.0im,  At=0.02nsec. 

We  adopt  the  following  absorbing  boundary  conditions  due  to  G.Mur's[6],  in  order  to  replace  the  actual 
situation  of  infinite  underground  region  by  a  finite  sized  rectangular  region. 

(9) 

V  V  di) 

where  n  is  out  ward  pointing  normal  on  tlie  boundary.  And  v  is  propagation  velocity. 

So,  electric  field  Ey  on  tire  boundary  X=0  is 

£/{0,9)=  E/-’(1,9)+C3{E/(1,9)-E/-'(0,9)}  (10) 

where 

(11) 
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5.  Statistical  Representation  of  Random  Medium 

Statistically  a  random  medium  is  represented  by  a  multivariable  random  function  with  parameters 
representing  the  constants  and  variables  of  the  medium  including  the  obstacles  and  the  target  bodies  within 
the  region.  In  the  present  study  the  parameters  are  the  electric  permittivity,  size  and  location  of  the  obstacles 
which  are  random  variables.  Fourier  analysis  can  be  applied  to  this  statistical  representation  by  considering 
tlie  complexity  of  random  medium  as  an  input  data.  If  the  input  data  is  considered  as  a  two  dimensional 
random  function  eip,q h  where  die  integer  variablesp  and  q  correspond  to  spatial  and  temporal  coordinates 
in  physical  space,  the  Fourier  spectrum  that  formally  represents  the  complexity  of  random  medium  is  given 


Here  j  =  4^  and  M  and  //denote  tlie  matrix  size  of  the  random  data,  k  and  I  correspond  to  the  two 
dimensional  spatio-temporal  frequency  domain. 

This  spectrum  can  be  normalized  as  follows: 
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A  limiting  value  of  normalization  can  be  obtained  by  considering  an  appropriate  standard  random 
medium.  Thus  a  Gaussian  distribution  for  the  function  e(p,  q)  is  chosen  as  a  suitable  standard.  Then 

=  (Jt  =  0- ■M-1,/  =  0-  //-1)  (14) 

MN 

For  an  arbitrary  random  function  e{p,  q),  a  formal  estimation  of  the  randomness  can  be  given  by  the 
variance  Of  given  as  a  deviation  from  the  Gaussian  case 

o/  =  jTl{F.(k,l)-Ef  where  £  =  -^  (15) 

V  t=o  /=o  mJV 

Another  equivalent  statistical  representation  is  given  by  the  two  dimensional  auto-correlation  function 
defined  by 

Af-I  JV-! 

S(m,n)=  Z  Se(j:,y)£(^  +  m,y+n) 

=  F-'{[F(;:,0I^}  (16) 

where  F’  indicates  the  inverse  Fourier  transform. 

In  this  case  an  estimation  of  the  medium  randomness  can  be  obtained  from  the  Fourier  transform  of  the 
auto-correlation  function,  by  defining  1/fe  value  of  spectral  intensity  distribution  as  the  indicative  auto¬ 
correlation  value. 

Thus  using  Fourier  analysis  it  is  convenient  to  represent  statistically  various  important  features  of  the 
random  medium  such  as  the  electrical  permittivity  of  the  obstacles,  their  size,  spacing  and  so  on. 

6.  Analysis  Results 

As  specific  examples  of  analysis,  we  consider  a  ground  medium  of  relative  electric  permittivity  £^=4, 
conductivity  ag=  1 0‘'[s/m],  permeability  and 

case  1:  N  =  400  obstacles  each  of  size  d  <  10cm  and  £,<  8. 

case  2:  N  =  400  obstacles  each  of  size  d  <  10cm  and  £^<32. 

case  3:  N  =  200  obstacles  each  of  size  d  <  10cm  and  £,<  8. 

The  models  considered  are  shown  inFigs.5-7  with  their  corresponding  parameters  displayed  in  Tables  1-3. 
The  results  of  FDTD  analysis  are  shown  in  Figs. 8- 10.  Fourier  power  spectrum,  auto-correlation  function, 
spatial  and  temporal  distribution  of  received  signals  are  shown.  We  can  understand  various  features  from 
these  figures. 

As  with  the  Fourier  Power  spectrum  of  the  input  data,  observing  Fig. 8(a)  and  Fig.9(a),  we  see  that  the 
variance  Of  drops  to  very  low  value  (CTr:0.028^0.0078)  as  the  relative  electric  permittivity  £i  increases  four 
times  (£,:8-^32).  This  is  when  other  parameters  remain  same  i.e.,  for  N=:  400  obstacles  each  of  size  less  than 
d  <  10cm.  Observing  Fig.8(a)  and  Fig,  10(a),  where  the  number  of  obstacles  alone  is  halved  (N;400->200), 
a,  increases  (af:0.028-^0.040)  showing  a  reduced  randomness. 

Observing  the  auto-correlation  contour  plots  we  do  not  find  a  change  in  the  characteristics  when  there  is 
change  only  ine,  (Fig.8(b),  Fig.9(b)),  showing  its  independence  with  the  number  and  sizeof  the  obstacles. 
As  the  number  of  obstacles  reduces  (Fig.8(b)  and  Fig.  10(b))  we  observe  a  spread  showing  again  a  reduced 
randomness. 

Observing  the  time  response  of  the  received  signal  (Figs.8(f),9(f),  10(f))  we  confirm  that  detection  of  air- 
gap  is  easy  when  the  randomness  is  not  significant.  For  instance  we  can  see  the  peak  due  to  air-gap  clearly 
in  Fig. 10(f)  where  CTpO.040  is  high.  As  the  medium  becomes  more  random,  areduction  in  a„  it  is  difficult 
to  identify  the  air-gap  (Fig. 8(f)).  In  Fig.8(c-d)9(c-d),10(c-d),  we  display  three  dimensional  plots  of 
electromagnetic  fields  showing  the  phenomena  of  reflection  from  air-gap.  The  time  instants  are  chosen 
closer  to  be  around  the  reflection  phenomena.  We  are  unable  to  identify  the  air-gap  clearly  when  the 
medium  is  complicated. 

From  this  analysis  weconclude  that  itis  possible  to  understand  the  underground  conditions  statistically 
using  the  Fourier  analysis  of  the  medium  conditions  and  estimating  the  auto-correlation  of  the  medium. 
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7.  Conclusion  ^  ^  •  i 

Statistical  representation  of  a  random  medium  is  done,  and  the  relationship  between  the  received  signal 
and  the  medium  is  established.  The  analysis  may  be  further  developed  in  many  ways  like  increasing  the 
accuracy  and  speed  of  the  FDTD  algorithm,  extending  to  three  dimensional  cases,  incorporating  more 
randomness  and  parameters  and  considering  the  cases  much  closer  to  practical  conditions.  Improvements 
can  also  be  accomplished  through  more  sophisticated  signal  processing  of  the  received  signal  patterns. 
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Fig. 5  Underground  random  model:  case  1 
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Fig. 6  Underground  random  model:  case  2 
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Fig. 7  Underground  random  model:  case  3 
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Tables  Position  of  Obstacles(Case  3) 
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Fig.  8  Analysis  results  of  underground  random  model  in  the  case  L  e(p,g)  is  the 
relative  electric  permittivity  in  the  underground.  (c)~(e)  show  time 
response  of  Ey  in  the  analysis  region,  and  (f)  time  response  of  Ey  at  the 
observation  point(  1.28 ,0.1). 


Fig.  9  Analysis  results  of  underground  random  model  in  the  case  2.  z{p,q)  is  the 
relative  electric  permittivity  in  the  underground,  (c)'^(e)  show  time 
response  of  Ey  in  the  analysis  region,  and  (f)  time  response  of  Ey  at  the 
observation  point(  1.28, 0.1). 


(a)  Normalized  Fourier  power  spectrum 


(b)  Normalized  auto-correlation 
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Abstract 

A  hybrid  Method  of  Moments/Finite  Difference  Time  Domain  (MoM/FDTD)  approach  is  used  to 
characterize  UHF/VHF  propagation  over  irregular  terrain.  The  effect  of  terrain  on  path  loss  is  examined. 
Of  particular  interest  is  the  shadow  region  behind  the  hills.  The  standard  NEC-4  MoM  code  is  used  to 
determine  the  fields  propagated  by  the  antenna  over  flat  earth.  A  two  dimensional  FDTD  code  is  used  to 
calculate  the  fields  in  the  area  of  the  terrain  feature.  The  vertical  field  calculated  by  FDTD  is  converted  to 
path  loss  for  display. 

L  Introduction 

The  military  community  has  a  strong  interest  in  UHF/VHF  propagation.  To  enable  military  planners 
to  position  communication  resources,  a  knowledge  of  path  loss  over  irregular  terrain  must  be  obtained. 
With  recent  improvements  to  propagation  models  and  computers,  the  militaiy  planners  have  been  over¬ 
whelmed  with  solutions  to  this  problem.  These  improvements  have  made  it  possible  for  communication 
oflScers  to  quickly  calculate  the  path  loss  fi'om  several  models  to  use  as  inputs  to  planning.  Trade-offs  must 
be  made  in  the  selection  of  the  model  used  such  as  calculation  time  versus  accuracy.  With  limited 
theoretical  knowledge  in  the  area  of  electromagnetics,  and  the  significant  differences  in  the  answers  given 
by  various  models,  reliable  planning  becomes  veiy  difficult. 

When  considering  the  complexity  of  a  real  earth  environment,  it  is  apparent  that  even  the 
experienced  propagation  specialist  would  have  difficulty  based  only  on  the  information  supplied  by  the 
models.  After  considerable  research  into  the  theory  used  in  each  model,  one  can  begin  to  conjecture  on 
the  various  mechanisms  that  each  model  can  predict  accurately.  For  example,  a  particular  model  may  only 
calculate  reflections  and  not  diffraction.  Though  it  is  commonly  known  that  this  is  not  the  correct  solution, 
it  may  be  accurate  enough  for  a  particular  use.  The  problem  is  complicated  by  the  interactions  between 
individual  mechanisms. 

n.  Specific  Problem 

In  support  of  the  Radio  Frequency  Mission  Planner  (RFMP),  the  Naval  Information  Warfare 
Activity  (NIWA)  has  implemented  the  RFMP  Model  Characterization  Program.  The  RFMP  includes 
several  UHF/VHF  propagation  models  that  predict  terrain  effects.  These  models  are  based  on  diflferent 
theories  such  as  Geometric  Theory  of  Diffraction  (GTD)  [1],  knife-edge  diffraction  [2]  and  Parabolic 
Equation  (PE)  [3].  This  program  was  implemented  to  characterize  how  well  these  models  perform.  A 
generalized  measurement  campaign  would  be  costly  and  include  many  uncontrollable  variables.  Given  the 
computer  resources  on  hand.  Finite  Difference  Time  Domain  (FDTD)  was  a  viable  option  for  a  first  look 
at  simple  scenarios. 
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The  puipose  of  this  project  is  to  establish  a  rule  set  for  the  UHFAOT  propagation  models  and  to 
assist  in  developing  a  measurement  system.  The  path  loss  predicted  by  the  propagation  models  will  be 
compared  to  those  predicted  by  the  Method  of  Moments  /  Finite  Difference  Time  Domain  (MoM/FDTD) 
approach  and  color  contour  plots  showing  the  differences  will  be  presented.  In  other  words,  this  research 
provides  a  "yard  stick"  with  which  to  compare  the  propagation  models  to  determine  their  characteristics. 
That  yardstick  is  the  MoM/FDTD  approach. 

in.  The  MoM/FDTD  Approach 

The  hybrid  MoM/FDTD  approach  was  developed  to  characterize  UOT/VHF  propagation  models. 
FDTD  is  a  complete  solution  to  Maxwell’s  curl  equations  [4].  When  properly  implemented,  FDTD  includes 
the  effects  of  diffiaction,  reflections,  surface  waves,  and  all  others.  FDTD  has  been  shown  to  be  extremely 
accurate  for  scattering  calculations  [5].  The  unusual  application  in  this  case  involves  calculations  over  an 
infinite  half-plane  of  lossy  dielectric,  FDTD  by  itself  can  provide  very  good  results  for  small  propagahon 
problems.  Two  of  the  main  limitations  of  FDTD  are  speed  and  memory  requirements.  For  this  application, 
the  only  reasonable  solution  is  to  calculate  only  small  portions  of  the  area  with  FDTD.  The  fields  produced 
by  MoM,  which  take  into  account  both  the  antenna  pattern  and  the  effects  of  the  ground  plane,  are  used 
to  excite  the  FDTD  space.  The  FDTD  portion  of  the  code  then  calculates  the  electric  fields  in  the  area  of 
the  terrain  feature. 

rv.  Typical  Real  Scenario 

A  basic  set  of  scenarios  was  chosen  that  could  provide  good  terrain  characterization.  The  models 
are  compared  on  the  basis  of  path  loss  in  the  area  of  a  two-dimensional  cross  section  of  terrain.  The 
scenarios  consist  of  10  kilometer  flat  paths  between  the  transmitter  and  a  terrain  feature  (i.e.  hill).  The 
transmitter  is  a  vertical  dipole  placed  at  the  origin.  The  hill  is  150  meters  wide  and  100  meters  high 
(approximately  30  wavelengths).  The  hill  is  modeled  as  a  simple  wedge  or  gaussian  curve.  Double  hills 
are  also  examined.  The  ground  constants  are  those  of  average  earth;  relative  dielectric  constant  of  13  and 
conductivity  of 0.005  Siemens/meter.  The  ground  is  homogeneous.  The  skin  depth  is  approximately  3.8 
meters  and  the  loss  tangent  is  approximately  0.07  [6] . 

The  coordinate  system  is  defined  as  Mows,  The  radial  distance  from  the  antenna  will  be  along  the 
X-axis.  The  elevation  is  along  the  y-axis.  The  non-vaiying  coordinate  of  the  2-D  system  is  the  z-axis.  The 
ground  is  in  the  x-z  plane. 

V.  MoM  Implementation 

The  standard  Numerical  Electromagnetics  Code  (NEC)-4  is  a  widely  accepted  MoM  antenna 
modeling  code  [7,8].  It  is  used  here  to  calculate  the  fields  produced  by  a  vertical  antenna  over  an  infinite 
ground  plane.  A  half-wavelength  vertical  dipole  is  modeled  at  a  fi'equency  of  100  Megahertz  and  located 
one  wavelength  (3  meters)  above  ground.  The  ground  parameters  are  relative  dielectric  constant  of  13.0 
and  conductivity  of  0.005  Siemens/meter.  The  Sommerfeld/ Asymptotic  method  is  used  for  the  ^ound  to 
include  the  surface  wave.  An  excitation  of  1  watt  is  used.  The  Near  Electric  Field  (NE)  option  is  used  to 
calculate  the  electric  fields  along  a  vertical  line  10  kilometers  fi'om  the  transmit  antenna.  The  vertical 
spacing  of 2000  samples  is  40  per  wavelength  or  0.075  meters. 

The  magnitude  and  phase  of  the  incident  electrical  field  is  used  to  excite  the  FDTD  space.  The 
NEC  samples  are  located  at  the  center  of  each  vertical  FDTD  cell.  For  simple  antennas  near  the  ground 
and  far  from  the  FDTD  space,  a  single  vertical  slice  is  sufficient  since  the  horizontal  magnitude  variation 
of  the  incident  field  over  the  FDTD  area  is  small  (about  1/2  dB).  This  is  a  very  important  point.  If  the 
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radiation  from  the  antenna  is  not  propagating  primarily  along  the  ground  plane  this  method  may  not  work 
well.  The  assumption  that  the  magnitude  of  the  incident  field  has  very  little  variation  along  the  horizontal 
direction,  leads  to  this  simple  excitation.  If  the  fields  were  propagating  diagonally  across  the  FDTD 
calculation  space,  a  more  complex  incident  field  would  have  to  be  developed.  The  radial  variation  of  the 
phase  is  calculated  analytically  by  the  FDTD  code. 

The  NEC  code  is  also  us^  to  calculate  the  free-space  fields  in  the  area  of  interest.  This  is  an  area 
250  meters  horizontally  and  150  meters  vertically.  It  is  sampled  on  a  1  meter  by  1  meter  grid.  This 
information  is  used  later  along  with  the  FDTD  calculated  fields  to  produce  path  loss  [3]. 

VI.  FDTD  Implementation 

The  two-dimensional  FDTD  space  includes  a  terrain  feature  such  as  a  gaussian  hill.  The  source 
input  from  the  NEC  MoM  code  is  used  to  excite  the  FDTD  space.  This  will  require  a  steady-state  FDTD 
code  since  NEC  will  only  provide  fields  for  one  frequency. 

A  two-dimensional  FDTD  code  is  used  based  on  a  scattered  field  formulation  and  uses  a  Liao 
second  order  outer  radiation  boundary  condition  [4].  Several  areas  of  the  original  2-D  FDTD  code  have 
been  modified.  The  major  modification  involves  the  source  field.  The  FDTD  calculation  space  must  be 
excited  by  an  incident  field.  This  will  be  derived  from  MoM  calculations  over  flat  earth.  The  code 
ori^ally  used  an  exponentially  ramped  steady-state  plane  wave  with  an  specified  angle  of  incidence.  This 
was  modified  to  read  in  an  input  file  of  magnitude  and  phase  along  the  y-axis  and  propagated  in  the  x 
direction.  This  input  is  linearly  ramped  in  magnitude  and  propagated  from  outside  the  FDTD  space  in  a 
similar  manner  as  the  original  source.  This  method  of  introducing  the  source  reduces  transient  fields,  but 
significantly  increases  calculation  speed.  The  code  was  also  hard-coded  for  propagation  along  the  ground 
plane  or  x-axis.  Subroutines  have  been  added  to  sample  the  x  and  y  components  of  the  electric  field.  All 
FDTD  calculations  are  transverse  electric  (TE).  The  field  components  included  are  Ex,  Ey  and  Hz.  Square 
cells  are  used. 

A  ten  cell  layer  of  real  earth  cells  are  placed  along  the  lower  edge  of  the  FDTD  space.  This 
basically  nullifies  the  lower  free  space  boundary  condition.  Due  to  the  fact  that  propagation  is  primarily 
in  the  x  direction  and  upward  from  the  hill,  this  provides  an  adequate  lower  boundary.  This  method  worked 
very  well  and  was  simple  to  do.  An  alternate  method  of  redefining  the  lower  and  comer  boundary 
conditions  was  not  found  to  be  necessary  at  this  time. 

The  terrain  feature  or  hill  is  built  of  real  earth  cells  on  top  of  this  ground  plane.  A  secondary  hill 
is  built  inside  the  first  hill.  This  hill  is  built  of  Perfect  Electric  Conductor  (PEC)  material  and  is  kept  at  least 
two  sldn  depths  inside  the  first  hill.  Since  the  fields  at  this  point  inside  the  real  earth  hill  have  dropped  by 
about  20  dB,  the  PEC  hill  has  little  effect  on  the  fields  in  free  space  but  greatly  reduces  calculation  time. 
This  reduction  in  time  is  due  to  the  simplicity  of  the  PEC  update  equation.  This  also  reduces  possible  error 
fields,  caused  by  grid  dispersion,  from  passing  through  the  hill.  The  overall  effect  of  this  is  similar  to  the 
use  of  a  Surface  Impedance  Boundary  Condition  (SIBC)  which  has  not  yet  been  implemented.  But  due 
to  the  size  of  the  calculation  space,  grid  dispersion  errors  are  significant  [9].  It  has  been  found  that  30-40 
cells  per  wavelength  are  required  to  reduce  this  error  sufficiently  and  obtain  good  results  in  the  shadow 
region.  Although  calculation  times  could  be  reduced,  little  memory  savings  would  be  gained  by 
implementing  a  SIBC,  since  40  cells  per  wavelength  are  required  for  these  ground  parameters. 

Current  computer  resources  consist  of  an  HP  9000/755  UNIX  workstation  with  128  megabytes  of 
memory.  Trial  FDTD  runs  indicate  a  40  MFLOP  calculation  rate.  The  maximum  terrain  feature  size  is 
determined  by  the  frequency  and  computer  memory.  Calculations  are  conducted  at  100  megahertz.  Using 
typical  ground  constants  of  13  for  relative  dielectric  constant  and  0.005  Siemens/meter  for  conducti\aty, 


504 


requires  a  cell  size  of 0.075  meters  (40  cells  per  wavelength).  This  allows  approximately  seven  million  cells 
in  memory.  Initial  calculations  show  that  good  results  can  be  obtained  by  running  the  FDTD  code  for  three 
times  the  number  of  time  steps  as  there  are  horizontal  cells.  This  will  allow  a  calculation  space  of  45,000 
square  meters  (150  meters  by  250  meters)  and  require  approximately  33  hours  to  run.  The  original  code 
required  78  hours  for  this  c^culation. 

Due  to  the  size  and  number  of  cells  used,  3000  horizontally  by  2000  vertically,  sampling  of  all  cells 
was  not  required  for  good  resolution  plots.  Therefore  the  fields  are  sampled  on  a  1  meter  by  1  meter  grid. 
The  samples  are  taken  at  two  time  intervals  near  the  end  of  the  FDTD  run  and  one-quarter  time  cycle  apart. 
This  data  is  written  to  an  external  file.  These  fields  are  then  processed  to  determine  the  magnitude  and 
phase  of  the  steady-state  field  and  the  path  loss.  This  provides  a  plot  of  250  by  150  points.  Since  the 
objects  are  very  large,  stair  casing  is  negligible.  The  staircasing  of  the  terrain  appears  coarser  in  the  plots 
than  the  actual  FDTD  calculation  space. 

Vn.  Results 

The  FDTD  code  was  tested  against  the  exact  solution  [6]  for  a  2-D  cylinder.  The  cylinder  had  a 
diameter  of  30  meters  (10  wavelengths  at  100  megahertz).  These  results  were  excellent.  It  was  also  shown 
that  a  half  cylinder  over  an  "infinite"  plane  will  also  give  correct  results.  This  shows  that  some  special 
problems  can  be  run  with  FDTD  using  a  half-space. 

The  FDTD  code  was  also  tested  against  the  Numerical  Electromagnetic  Code  -  Basic  Scattering 
Code  (NECBSC),  Version  3  [10].  A  flat  plate  30  meters  (10  wavelengths  at  100  megahertz)  high  was 
illuminated  with  a  plane  wave.  Extremely  good  results  were  obtained.  Once  again,  half  the  plate  was 
placed  above  an  "infinite"  plate  with  very  good  results. 

The  next  test  case  mvolved  a  half-wave  vertical  dipole  placed  one  wavelength  above  a  ground  plane. 
The  antenna  was  placed  inside  the  FDTD  space.  The  space  was  60  meters  by  30  meters.  The  space 
included  a  15  meter  by  1 5  meter  PEC  wedge.  The  x  and  y  components  of  the  electric  field  were  compared 
on  an  interval  of  one  tenth  of  a  wavelength.  This  case  was  mn  with  NECBSC  and  FDTD.  The  results 
were  very  similar,  but  NECBSC  showed  very  distinct  shadow  and  reflection  boundaries  as  expected  from 
UTD  theory. 

Finally,  the  NEC  input  was  used.  The  same  half-wave  vertical  dipole  was  modeled  one  wavelength 
above  the  ground  and  10  kilometers  from  the  FDTD  calculation  space.  A  large  problem  space  of  250 
meters  by  150  meters  was  run  with  good  results.  The  first  hill  was  a  simple  wedge.  This  hill  was  run  with 
NECBSC  and  MoM/FDTD.  Both  PEC  and  real  earth  materials  were  used  and  only  subtle  differences 
appeared  with  different  materials.  The  comparisons  between  models  were  very  good.  The  major 
difference  was  in  the  shadow  region  where  MoM/FDTD  shows  an  interference  pattern  of +/-  3dB.  See 
Figure  1. 

A  single  real  earth  gaussian  hill  was  also  modeled.  This  lull  showed  very  similar  effects  from  the 
MoM/FDTD  result  for  wedge.  See  Figure  2.  This  was  expected  due  to  the  similarity  in  the  shapes.  This 
case  could  not  be  modeled  using  NECBSC  because  the  hill  was  made  up  of  multiple  plates  arid  Version  3 
of  NECBSC  does  not  include  multiple  diffractions.  Double  gaussian  hills  were  also  modeled  with  expected 
results. 

Vm.  Conclusions 

Several  sample  objects  have  been  modeled  using  the  2-D  FDTD  code  and  the  MoM/FDTD  hybrid 
approach.  These  objects  include  scatter  type  objects  (i.e.  cylinders  and  plates)  and  terrain  objects  (i.e. 
wedges  and  gaussian  hills).  The  FDTD  results  were  compared  against  NECBSC  and  exact  results.  These 
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results  show  extremely  good  comparisons.  It  has  been  shown  that  a  MoM/FDTD  hybrid  approach  can  be 
applied  to  UHFA'HF  terrain  propagation  problems  and  provide  extremely  detailed  and  accurate  results. 
This  is  definitely  not  the  end  of  this  area  of  work  but  just  the  beginning.  More  efficient  and  elegant  ways 
of  applying  these  methods  can  be  applied.  There  are  also  other  areas  of  applications  such  as  rough  surfaces 
and  inhomogeneous  materials. 

IX.  Acknowledgments 

Spedal  thanks  go  to  NIWA,  specifically  CDR  John  O'Dwyer,  for  sponsoring  this  research.  Also, 
thanks  go  to  my  advisor  at  Perm  State,  Dr.  James  Breakall,  and  my  supervisor  at  ARL,  Allan  Sonsteby. 
I  would  also  like  to  thank  Dr.  Raymond  Luebbers  of  Penn  State  for  his  assistance  and  supplying  the  original 
2-D  FDTD  code. 

James  Zmyslo  was  formerly  at  the  Naval  Information  Warfare  Activity,  3801  Nebraska  Avenue, 
NW,  Washington,  DC  20393. 

X.  References 

[1]  J.  Keller,  "Geometric  theory  of  diffraction,"  J.  Opt.  Soc.  of  Amer.,  52  ,116,  1962 

[2]  J.  Deygout,  "Multiple  Knife-Edge  Diffraction  of  Microwaves,"  IEEE  Trans.  Ant.  Prop.,  14,  480, 
1966 

[3]  F.  Ryan,  "User's  Guide  for  the  VTRPE  (Variable  Terrain  Radio  Parabolic  Equation)  Computer 
Model,"  Naval  Ocean  Systems  Center  (NOSC),  1991 

[4]  K.  Kunz,  R.  Luebbers,  Finite  Difference  Time  Domain  Method for  Electromagnetics,  CRC  Press, 
Boca  Raton,  1993 

[5]  R.  Luebbers,  K.  Lysiak,  M.  Husar,  S.  McClure,  "Validation  of  FDTD  Scattering  from  Realistic 
Targets;  Generic  Missile,"  1994 

[6]  C.  Balanis,  Advanced  Engineering  Electromagnetics,  Wiley,  New  York,  1989 

[7]  G.  Burke,  'Numerical  Electromagnetics  Code  -  NEC-4,  Method  ofMoments,  Part  I:  User's  Manual," 
Lawrence  Livermore  National  Laboratory,  1992 

[8]  G.  Burke,  "Numerical  Electromagnetics  Code  -  NEC-4,  Method  ofMoments,  Part  II;  Program 
Description  -  Theory,"  Lawrence  Livermore  National  Laboratory,  1992 

[9]  A  Taflove,  Computaional  Electrodynamics  -  The  Finite-Difference  Time-Domain  Method,  Artech 
House,  1995 

[10]  R.  Marhefka,  J,  Silvestro,  "Near  Zone  -  Basic  Scattering  Code  User's  Manual  with  Space  Station 
Applications,"  ElectroScience  Laboratory,  1989 


506 


Validation  of  FDTD  Modeling  of  Ground-Penetrating  Radar  Antennas 


Brian  J.  Zook 

Southwest  Research  Institute 
P.O.  Drawer  28510 
San  Antonio,  TX  78228-0510 
bzook@swri.edu 


Introduction 

Ground-penetrating  radar  (GPR)  is  commonly  used  for  detecting  buried  objects.  Antenna  design,  soil 
parameters,  and  target  characteristics  all  strongly  influence  the  effectiveness  of  any  GPR  system.  Until 
recently,  optimal  GPR  designs  required  extensive  measurements  and  design  iterations.  Modeling  offers  the 
promise  of  finding  quality  designs  before  the  expensive  construction  of  prototypes. 

We  are  using  the  finite-difference  time-domain  (FDTD)  method  to  model  GPR  antennas.  There  are  several 
advantages  to  using  FDTD  modeling  for  this  appUcations.  First,  FDTD  can  model  the  fields  inside  the  soil, 
which  is  a  dispersive  (and  dissipative)  dielectric.  Second,  the  general  and  flexible  nature  of  FDTD  allows 
us  to  model  complex  antennas,  targets  (conducting  and  non-conducting),  and  possible  soil  inhomogeneities 
in  a  single  model  [1],  A  third  advantage  is  that  we  can  find  the  broadband  response  of  a  system  by  using 
pulsed  excitation,  that  is,  with  a  single  FDTD  calculation.  Thus  we  can  model  pulsed  GPR  systems  directly, 
as  well  as  stepped-ffequency  and  swept-fi-equency  systems  using  post-processing.  We  are  using  the 
damped  Higdon’s  outer  radiation  boundaiy  condition,  which  can  model  dispersive  media,  including  the 
air/soil  interface  [2]. 

In  this  paper  we  present  some  validation  results  of  FDTD  models  of  GPR  antennas,  comparing  3D  FDTD 
models  with  NEC  results  and  analytic  results.  We  show  comparisons  of  the  actual  fields  (magnitude  and 
phase)  in  both  the  ground  and  the  air  over  a  broad  frequency  range.  Field  plots  are  our  primary  means  of 
comparison,  although  a  few  quantitative  measures  are  used. 

Vertical  InFinitesimal  Antenna  Over  the  Earth 

Few  GPR  antennas  have  an  analytic  field  solution.  An  exception  is  a  vertical  infinitesimal  antenna  over  a 
lossy  dielectric  earth.  King  derived  formulas  for  the  (near-field  and  far-field)  fields,  both  in  the  earth  and 
above  it,  produced  by  an  infinitesimal  dipole  located  over  the  earth  [3,4].  Simply  stated,  the  fields  are  the 
sum  of  three  contributions:  a  direct  wave,  an  image  wave,  and  a  lateral  (surface)  wave.  The  lateral  wave 
contribution  uses  a  numerical  integral.'^  The  fields  are  proportional  to  I*dl,  the  product  of  the  impressed 
driving  current  and  the  antenna  length.  We  calculate  the  fields  at  the  frequencies  of  interest,  using  the 
currents  found  in  the  NEC  model. 


■^The  lateral  wave  can  be  approximated  using  Fresnel  integrals  in  the  far-field,  but  we  are 
looking  at  near-field  effects  in  this  paper. 
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The  FDTD  model  of  the  infinitesimal  dipole  uses  a  single  FDTD  component  (E^)  driven  by  a  pulsed  source. 
The  fi-equency-domain  view  of  the  excitation  voltage  is  shown  in  Figure  1 .  There  is  reasonable  energy  over 
the  range  50-650  MHz;  we  can  perhaps  use  up  to  800  MHz.  The  cell  size  in  all  three  dimensions  was  1 
cm.  We  are  using  a  scattered-field  formulation  code  derived  fi-om  the  Penn.  State  code  [5].  We  are  not 
using  the  thin-wire  enhancement,  however,  since  we  do  not  use  it  in  our  modeling  of  larger  GPR  antennas. 
The  results  shown  in  this  paper  use  a  47x89x80  cell  space  running  for  3000  timesteps,  which  took  a  little 
over  7  hours  to  mn  on  a  Sun  SPARC2-class  machine.'^  The  lower  30  layers  of  the  FDTD  volume  was  filled 
with  a  dielectric  earth  having  the  properties  e=9  and  cr=0. 1  S/m,  The  center  of  the  dipole  was  4.5  cm  above 
the  earth,  coordinates  (0,0,4. 5). 

We  are  using  NEC4D,  the  double-precision  version  of  NEC-4,  since  we  are  modeling  very  short  dipoles. 
The  NEC  model  of  the  infinitesimal  dipole  uses  a  single  unloaded  segment,  excited  using  the  applied-field 
excitation,  sitting  over  the  lossy  earth,  which  is  modeled  using  the  Sommerfeld/ Asymptotic  approach.  The 
radius  was  3.2  mm,  which  we  found  empirically  to  yield  the  same  impedance  as  the  FDTD  model. 

We  compared  the  fields  at  24  locations,  12  in  the  air  and  12  in  the  ground.  In  general,  there  was  similar 
agreement  at  all  the  locations,  although  space  does  not  allow  us  to  show  all  the  results  here.  The 
magnitude  and  phase  comparisons  for  the  z-component  of  the  electric  field  in  the  air  at  (20,0,4.5)  are 
shown  in  Figures  2  and  3 .  The  magnitude  and  phase  comparisons  for  the  z-component  of  the  electric  field 
in  the  air  at  (20,0,-14.5)  are  shown  in  Figures  4  and  5.^^  The  basic  shapes  of  the  magnitude  plots  are  the 
same  for  the  different  methods,  although  the  actual  magnitudes  are  slightly  different.  We  believe  this  is  due 
to  the  different  approximations  being  used.  For  example,  the  analytic  solution  considers  the  current  to  be 
constant  along  the  dipole  while  NEC  uses  a  sinusoidal  current.  For  the  two  locations  shown,  the  FDTD 
and  analytic  solutions  agree  to  within  3 .8  dB  and  1 1  °  over  25-700  MHz. 

Vertical  Finite  Dipole  Over  the  Earth 

We  are  really  interested  in  usrng  FDTD  to  model  finite  antennas.  So  as  a  validation  test,  we  examined  the 
fields  fi-om  an  1 1-cm  long  vertical  center-driven  dipole.  The  center  of  the  dipole  was  located  9.5  cm  above 
the  earth.  This  was  modeled  as  1 1  cells  in  FDTD  and  1 1  segments  in  NEC  (again  with  3.2  mm  radius). 
For  the  analytic  formulation,  we  found  the  fields  as  the  superposition  of  1 1  infinitesimal  dipoles,  with  the 
currents  being  supplied  by  the  results  of  the  FDTD  model.  We  could  have  used  the  currents  from  the  NEC 
model,  but  the  radius  we  used  in  the  NEC  model  resulted  in  a  slightly  (10-15%)  higher  antenna  susceptance 
in  the  NEC  model  than  in  the  FDTD  model. 

Again,  we  compared  the  fields  in  several  locations,  only  a  few  of  which  we  show  here.  Figures  6  and  7 
show  the  magnitude  and  phase  comparisons  for  the  z-component  of  the  electric  field  at  (20,0,9.5).  The 
FDTD  and  analytic  methods  show  remarkable  agreement,  particularly  considering  we  are  using  the  analytic 


^We  probably  could  have  ended  the  runs  after  only  2000  timesteps 

^^The  field  locations  are  the  locations  of  the  sampled  FDTD  fields,  which  are  offset  by 
one-half  cell  in  the  staggered-grid  formulation  we  are  using. 


509 


method  in  a  somewhat  questionable  fashion.  The  NEC  result  is  vastly  different,  however.  Figures  8  and  9 
show  the  magnitude  and  phase  comparisons  for  the  z-component  of  the  electric  field  at  (20,0,-14.5).  Again 
the  same  observations  apply.  The  NEC  formulation  apparently  does  not  include  the  lateral  wave  when 
calculating  the  electric  field.  Indeed,  the  NEC-4  User's  Guide  contains  figures  showing  the  error  at  grazing 
incidence,  which  is  where  the  lateral  wave  contribution  is  more  important.  The  NEC  result  for  the  x- 
component  at  (20.5,0,-15),  shown  in  Figure  10,  is  much  closer  to  the  other  two  methods.  We  believe  this 
is  due  to  the  lateral  wave  contribution  being  less  important  for  the  x-component  at  that  position. 
Furthermore,  it  appears  that  NEC  is  properly  calculating  the  dipole  currents,  just  not  the  resultant  electric 
fields.  For  these  cases,  the  FDTD  and  analytic  solutions  agree  to  within  1 .4  dB  and  23  °  over  25-700  MHz. 

Conclusions 

We  have  compared  results  from  FDTD,  NEC,  and  an  analytic  solution  for  two  vertical  dipole  cases.  The 
results  show  the  FDTD  method  is  modeling  the  fields  produced  by  the  antennas  near  and  in  the  ground  with 
reasonable  accuracy,  certainly  sufficient  for  the  insight  we  need  as  we  design  GPR  systems.  One  lesson 
learned  in  this  study  was  that  NEC  will  not  be  as  helpful  as  we  had  hoped  in  studying  and  validating  more 
complex  antenna  models  for  GPR  applications  due  to  the  approximations  made  in  calculating  the  fields, 
although  these  approximations  are  accurate  for  most  applications  of  NEC. 
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Figure  1.  The  amplitude  spectrum  of  the  excitation  pulse  used  in  the  FDTD  runs.  FDTD 
results  were  divided  by  this  spectrum  to  get  the  frequency  response  shown  in  the  figures, 
as  if  the  antennas  were  driven  by  1  V/m  at  each  frequency. 
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Figure  2,  The  magnitude  of  the  Z-component  (vertical)  of  the  electric  field  20  cm  away  from 
the  infinitesimal  dipole  and  4.5  cm  above  the  ground  (same  height  as  the  center  of  the  dipole) 
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Figure  3.  The  phase  of  the  Z- component  of  the  electric  field  20  cm  away  fi"om  the 
infinitesimal  dipole  and  4.5  cm  above  the  ground  (same  height  as  the  center  of  the 
dipole).  The  NEC  and  Analytic  phase  curves  are  too  close  to  distinguish. 


Figure  4.  The  magnitude  of  the  Z-component  of  the  electric  field  20  cm  away  from  the 
infinitesimal  dipole  and  14.5  cm  below  the  ground  surface. 
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Figure  5.  The  phase  of  the  Z-component  of  the  electric  field  20  cm  away  from  the 
infinitesimal  dipole  and  14,5  cm  below  the  ground  surface. 
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Figure  6.  The  magnitude  of  the  Z-component  of  the  electric  field  20  cm  away  from  the  infinite 
dipole  and  9.5  cm  above  the  ground  (same  height  as  the  center  of  the  dipole).  Notice  the 
NEC  result  is  vastly  different  from  the  other  two  methods. 
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Figure  7.  The  phase  of  the  Z-component  of  the  electric  field  20  cm  away  fi-om  the  finite 
dipole  and  9.5  cm  above  the  ground  (same  height  as  the  center  of  the  dipole).  Notice 
the  NEC  result  is  vastly  different  from  the  other  two  methods 
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Figure  8,  The  magnitude  of  the  Z-component  of  the  electric  field  20  cm  away  from  the  finite 
dipole  and  14.5  cm  below  the  ground  surface.  Notice  the  NEC  result  is 
significantly  different  fi-om  the  other  two  methods. 
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Field  of  Finite  Dipole  -  E,(20,0,-14^ 
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Figure  9.  The  phase  of  the  Z-component  of  the  electric  field  20  cm  away  from  the  finite 
dipole  and  14.5  cm  below  the  ground  surface.  Notice  the  NEC  result  is 
significantly  different  from  the  other  two  methods. 


Field  of  Hnite  Dipole  -  £,(20.5,0,-15) 


Figure  10.  The  magnituu.  u.ic  ,^-component  (tionzontal)  of  the  electric  field  20.5  cm  away 
from  the  finite  dipole  and  15  cm  below  the  ground  surface.  Here  the  NEC  result  is 
in  reasonable  agreement  with  the  other  two  methods. 
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Abstract 

The  Finite  Difference  Time  Domain  (FDTD)  method  has  been  used  extensively  to  model 
the  radiation  characteristics  from  a  variety  of  antenna  structures.  In  this  paper  the  method 
is  applied  to  a  novel  conical  transverse  electromagnetic  (TEM)  antenna  which  has  a  dielectric 
lens  at  the  aperture  opening.  The  TEM  structure  is  chosen  due  to  its  ability  to  radiate  fast 
transient  pulses  and  is  constrained  by  a  lens  to  collimate  and  thus  increase  the  Foresight 
electric  field  strength  in  the  far  zone.  This  paper  describes  the  application  of  the  FDTD 
method  to  this  structure  and  compares  the  numerically  computed  results  to  those  obtained 
via  experimental  investigations  for  the  same  radiating  structure.  It  will  be  shown  that 
excellent  agreement  is  obtained. 


1  Introduction 

In  recent  years,  wideband  antennas  have  gained  in  popularity  due  to  their  ability  to  radiate  a 
broadband  of  energy  with  a  single  input  pulse.  In  an  effort  to  increase  the  frequency  response  of 
such  antennas,  researchers  have  been  utilizing  transient  driving  functions  with  faster  rise-times 
[1|.  However,  since  experimental  investigations  for  antenna  research  are  often  expensive  and  time- 
consuming,  computational  studies  are  used  to  aide  the  antenna  designers.  The  FDTD  method  is 
an  ideal  choice  for  wideband  antenna  simulations  since  all  field  quantities  are  calculated  in  the 
time-domain.  In  addition,  the  input  driving  function  is  analytically  expressed  as  a  time-domain 
parameter  which  allows  maximum  control  over  the  simulations.  Finally,  by  modeling  a  transient 
antenna  response  in  the  time-domain,  the  numerical  investigations  themselves  are  ideally  efficient 
w'hen  compared  to  traditional  frequency  domain  techniques. 

A\ith  the  preceding  in  mind,  the  FDTD  technique  has  been  applied  to  a  novel  transient  an¬ 
tenna  configuration  consisting  of  a  conical  TEM  with  a  lens  termination.  Initially,  this  research 
sought  a  determination  of  the  suitability  of  FDTD  for  modeling  fast  (sub  hundred  picosecond) 
transient  phenomena.  To  that  end,  the  FDTD  method  was  utilized  to  simulate  the  far  zone  ra¬ 
diated  response  from  the  conical  TEM  antenna  (on  Foresight)  and  comparisons  were  made  to 
existing  experimental  data  for  the  same  structure.  As  the  following  sections  show,  these  compar¬ 
isons  demonstrated  excellent  agreement  proving  FDTD  to  be  a  useful  tool  for  wideband  antenna 
research. 
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2  Conical  TEM  Antenna 

The  design  of  the  lens  constrained  conical  TEM  antenna  is  based  on  an  in-house  Phillips  Labora¬ 
tory  investigation  [2],  and  is  comprised  of  a  half-conical  TEM  structure  over  a  large  ground  plane 
with  a  lens  termination  at  the  aperture  (Figure  1). 


Side  View 


Lens 


Front  View 

Figure  1:  Schematic  of  conical  TEM  antenna  with  lens  at  aperture. 

The  lens  is  positioned  on  the  antenna  to  focus  the  far  zone  electromagnetic  fields  on  bore- 
sight  thereby  increasing  the  peak  radiated  field  strength.  This  focusing  allows  the  antenna  to 
minimize  the  leaky  characteristics  typically  found  with  “open’'  radiation  structures.  Made  from 
a  polyethylene  material,  the  lens  has  a  relative  index  of  refraction  of  Ur  -  1-517.  Physically,  the 
antenna  is  33.15  inches  long  by  24  inches  wide  (at  the  aperture  opening)  with  a  height  of  12.n6 
inches  with  respect  to  the  feed  region.  The  half  conical  angle  is  ^  =  23.14°.  The  large  conducting 
ground  plane  is  used  in  the  antenna’s  configuration  to  allow  simulation  of  a  full  conical  structure 
via  imaging  techniques. 

3  FDTD  Simulation  of  the  Conical  TEM  Antenna 

The  FDTD  method  utilizes  a  second  order  accurate  finite  difference  approximation  to  the  Maxwell 
curl  equations  in  the  time  domain  [3].  All  vector  field  quantities  are  calculated  on  a  uniform 
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spatial  grid  which  closely  approximates  the  physical  scatterer.  Therefore  to  maintain  accuracy 
in  the  simulations,  it  is  very  important  to  construct  an  accurate  mesh  so  that  the  simulations 
represent  as  closely  as  possible  the  real  antenna. 

The  operation  of  the  conical  TEM  antenna  relies  on  the  use  of  imaging  techniques  due  to  the 
large  ground  plane  present.  Unfortunately,  this  ground  plane  is  a  computationally  prohibitively 
large  since  it  requires  the  discretization  of  a  large  volume  of  free  space.  Thus,  when  modeling  an 
antenna  of  this  type,  it  is  better  to  construct  a  mesh  which  approximates  a  full  conical  description. 
The  electrical  characteristics  of  these  models  are  identical. 

The  mesh  created  for  the  conical  TEM  antenna  utilized  a  uniform  spatial  increment  of  ^  = 
=  Sy  =  6,  =  3.815  mm.  This  allowed  accurate  frequency  resolution  to  approximately  7.87  GHz 
according  to  the  modified  Nyquist  condition, 

(1) 

where  \rnin  corresponded  to  the  wavelength  of  the  highest  frequency  of  interest  in  the  simulation. 
Additionally,  this  cell  size  also  accommodated  all  physical  characteristics  of  the  antenna  including 
the  feed  region.  A  mesh  was  generated  according  to  this  spatial  increment  by  utilizing  a  Phillips 
Laboratory  automated  mesh  generation  tool.  The  resulting  mesh  contained  approximately  7 
million  cubic  cells  requiring  significant  computer  memory  and  disk  space  for  the  simulations.  The 
time  increment  was  determined  to  be  7.33  picoseconds  as  calculated  by  the  Courant  stability 
condition  for  cubic  cells  and  each  simulation  was  run  for  a  total  of  1024  time-steps. 

To  simulate  the  antenna’s  broadband  behavior,  a  unit  step  function  was  analytically  imposed 
on  the  grid  at  the  feed  region.  The  function  was  calculated  at  each  time-step  and  had  the  form 


Eij,k  =  1  -  exp 


(2) 


where  tq  was  a  constant  to  relate  the  numerical  driving  function  to  the  actual  pulse  rise  time. 
Adequate  numerical  simulation  of  the  antenna’s  driving  function  was  particularly  important  in 
this  investigation  since  the  conical  TEM  was  driven  with  a  step  function  whose  rise- time  was 
80  picoseconds.  In  order  to  accurately  model  the  antenna’s  transient  response,  there  had  to 
be  a  sufficient  number  of  time-steps  in  the  rising  edge  of  the  analytic  driving  function.  At  the 
spatial  and  temporal  discretization  utilized,  the  initial  ramp  of  Eq.  (2)  was  comprised  of  over  ten 
time-steps,  which  allowed  sufficient  temporal  resolution  of  the  driving  function. 

Figure  2  is  a  plot  of  the  antenna’s  source  function,  at  the  feed  region,  sampled  each  time- 
step  for  the  duration  of  the  simulation.  The  plot  shows  the  electric  field  to  rise  to  unity  over  80 
picoseconds  and  then  maintain  unit  amplitude  for  the  duration  of  the  simulation.  Additionally, 
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the  figure  clearly  indicates  that  the  driving  function  has  no  numerical  aberrations  such  as  flutter 

or  late  time  noise.  ,11  +  4.-  i 

In  the  FDTD  simulations,  Liao  second  order  boundary  conditions  enclosed  the  computational 

domain  at  a  distance  of  five  white  space  cells  from  the  edges  of  the  mesh.  This  allowed  all  spurious 
fields  to  be  absorbed  at  the  boundary  truncations  preventing  corruption  of  the  computed  antenna 

To  test  the  antenna’s  far  zone  radiation  characteristics,  a  near  to  far  zone  transformation  was 
utilized  to  determine  the  boresight  radiated  field  strength  [4].  The  FDTD  code  first  calculated 
all  the  electric  and  magnetic  fields  on  a  fictitious  “integration  surface”  surrounding  the  antenna. 
These  fields  then  determined  the  surface  electric  and  magnetic  current  distributions,  thereby 
allowing  numerical  calculation  of  the  far  field  radiation  integrals.  The  far  zone  transformation 
was  normalized  with  respect  to  distance  from  the  scatterer  so  the  computed  far  field  results  did 

not  these  include  distance  terms.  ^  •  n  1  i. 

Two  sets  of  simulations  were  performed  on  the  conical  TEM  antenna.  Initially,  the  antenna 
response  of  an  identical  conical  TEM  without  the  lens  was  calculated.  Secondly,  the  lens  was 
added  and  the  simulations  were  run  again  so  the  effects  of  the  dielectric  lens  could  be  compare 
to  the  case  with  no  lens. 
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Due  to  the  large  volume  of  the  computational  domain,  all  simulations  were  run  on  Sun  Mi¬ 
crosystems  SPARC  class  workstations.  Each  simulation  required  approximately  20  hours  of  com¬ 
pute  time  to  calculate  both  the  field  quantities  and  the  far  zone  transformations.  All  data  w^as 
post-processed  and  analyzed  utilizing  in-house  developed  Phillips  Laboratory*  FDTD  data  analysis 
tools. 


4  Results 

Since  one  of  the  primary  goals  of  this  research  w'as  to  determine  the  suitability  of  the  FDTD 
method  to  simulate  fast  transient  phenomena,  all  calculated  results  were  compared  to  experimental 
data  for  the  conical  TEM  antenna.  The  experiments  previously  performed  on  the  antenna  involved 
sampling  the  far  zone  radiated  electric  field  on  boresight  at  the  edge  of  the  ground  plane.  Thus, 
comparisons  w'ere  made  between  the  FDTD  simulated  and  experimentally  measured  antenna 
responses  for  both  the  lens  and  no  lens  cases  previously  mentioned. 


Figure  3;  Computed  far  zone  response  for  conical  TEM  antenna  wdth  and  without  lens. 
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Figure  3  is  an  overlap-  plot  of  tlie  computed  far  zone  antenna  responses  on  boresight  for  the 
conical  TEM  antenna  with  and  without  the  lens.  The  data  indicates  that  with  the  lens  present, 
the  field  amplitude  is  increased  b.y  approximately  a  factor  of  two  showing  that  the  lens  acts  to 
focus  the  electric  field.  Additionally,  there  is  an  obvious  time  delay  for  the  case  with  the  lens 
present  due  to  a  decrease  in  the  field's  phase  velocity  through  the  dielectric  media.  Figure  4  is 


Figure  4:  Measured  far  zone  response  for  conical  TEM  antenna  with  and  without  lens. 

a  similar  overlay  for  the  antenna’s  experimental  data.  It  shows  the  same  features  as  the  compu¬ 
tational  results  and  the  agreement  between  the  two  sets  of  data  is  excellent.  From  Figure  3,  the 
approximate  pulse-widths  with  and  without  the  lens  are  113  and  230  picoseconds,  respectively. 
This  compares  favorablv  with  the  experimental  values  of  120  and  260  picoseconds  for  the  lens  and 
no  lens  cases  shown  in  Figure  4.  The  resulting  average  error  between  the  computed  and  measured 
values  is  approximately  eight  percent. 


5  Conclusions 

This  paper  has  shown  the  application  of  the  FDTD  method  to  a  conical  TEM  antenna  with 
a  lens  termination.  The  computed  data  demonstrated  excellent  agreement  with  experimental 
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investigations.  Importantly,  this  work  has  also  shown  the  applicability  of  the  FDTD  method  to 
simulate  wideband  phenomena.  Provided  sufficient  temporal  resolution  existed  in  the  initial  ramp 
of  the  driving  function,  the  FDTD  method  accurately  computed  the  far  zone  antenna  response 
for  a  driving  function  with  an  80  picosecond  rise-time. 

Computational  simulations  of  antenna  phenomena,  as  these  results  show,  provide  traditional 
wideband  antenna  designers  an  efficient  way  to  augment  their  analysis  “tool- kits.”  By  performing 
numerical  investigations  in  conjunction  with  experimental  studies,  it  is  possible  to  reduce  the 
overall  time  and  monetary  costs  associated  with  the  antenna  design  and  test  processes. 
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ABSTRACT 

A  center-fed  radiating  electric  dipole  antenna  driven  from  various  excitation  sources  is  analyzed 
using  the  Finite-Difference  Time-Domain  technique  (FDTD).  The  sources  that  are  considered  are  a 
Gaussian  pulse,  a  modulated  Gaussian  pulse,  and  an  impulse.  The  antenna  is  modeled  with  a  source 
resistance.  Results  in  terms  of  the  magnitude  of  the  radiation  impedance  of  the  dipole  are  obtained  tor 
various  sources  and  a  comparison  with  the  Numerical  Electromagnetic  Code  (NEC)  results  is  made  to 
investigate  their  accuracy.  In  addition,  due  to  the  great  effect  of  the  antenna  radius  on  the  NEC  results, 
a  study  is  done  to  yield  a  close  approximation  of  the  wire  radius  based  on  the  FDTD  cell  dimensions. 


1.  INTRODUCTION 

The  Finite-Difference  Time-Domain  technique,  first  introduced  by  Yee  [1],  has  been  proven  to  be 
an  effective  and  convenient  tool  for  time-domain  analysis  of  various  electromagnetic  problems.  Although 
it  has  been  applied  to  numerous  scattering  and  penetration  problems,  its  application  to  problems  mvolvmg 
antenna  radiation  has  been  limited  [2-4].  In  addition,  its  effectiveness  is  somewhat  limited  by  the  choice 
of  the  excitation  source  used  to  drive  the  radiating  object. 


A  Gaussian  pulse  or  a  modulated  Gaussian  pulse  is  commonly  used  to  dnve  a  radiating  system. 
This  type  of  excitation  limits  the  frequency  domain  analysis  to  a  specific  frequency  range  provided  by  the 
source.  Moreover,  excessive  computer  time  and  raemoiy  are  required  if  a  broad  frequency  range  is 

desired. 


In  this  paper,  a  center-fed  electric  dipole  antenna  is  analyzed  using  the  FDTD  technique  [5].  The 
antenna  is  driven  by  various  sources  of  excitation  namely,  a  Gaussian  pulse,  a  modulated  Gaussian  pulse, 
and  an  impulse.  The  radiation  impedance  of  the  antenna  is  computed  and  compared  wiA  the  resiUts 
obtained  the  Method  of  Moments  (MoM)  via  NEC  [6].  In  addition,  In  order  to  comp^e  the  FDTD  resdts 
with  those  obtained  by  MoM,  a  study  is  done  to  determine  the  radius  of  the  dipole  required  for  NbL 
computations  based  on  the  FDTD  cell  dimensions.  The  analysis  of  the  antenna  is  earned  out  with  a  source 
resistance  of  50Q.  The  source  resistance  is  incorporated  in  the  original  FDTD  formulations  employing 
the  technique  described  in  [7]. 
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II.  ANALYSIS 


In  order  to  model  and  analyze  the  electric  dipole  antenna,  the  technique  presented  in  [8]  is 
employed.  A  wire  dipole  of  0.41  cm  length  is  considered.  The  problem  space  is  41x41x80  cells  with  the 
cell  dimensions  being  Ax  =  Ay  =  0.5  cm  and  Az  =  1 .0  cm.  This  results  in  a  time  step  of  At  =  1 1 . 1 1  ps  set 
at  the  current  limit.  The  FDTD  problem  space  is  shown  in  Figure  1 . 


Figure  1 ,  FDTD  problem  space  of  a  dipole  antenna  for  calculating  the 
radiation  impedance.  Dimensions  are  given  in  FDTD  cells, 
where  the  cell  size  is  0.5  x  0.5  x  1 .0  cm. 


To  model  the  dipole  antenna,  the  z-component  of  the  electric  field  is  set  to  zero  along  the  wire 
except  for  the  component  at  the  wire  center.  This  field  component  is  used  to  represent  the  excitation 
source  for  the  FDTD  calculations.  This  is  illustrated  in  Figure  2.  In  addition,  the  electric  field  given  by 
equation  (1)  is  modified  to  take  into  consideration  the  source  impedance.  This  leads  to  adding  the 
contribution  of  the  current  flowing  through  the  excitation  source  resistance  to  equation  (1).  The  technique 
is  clearly  outlined  in  [7]. 


524 


K- 


-1 

-  T 
-T 

t<_  vohage 
I  source 

-  I 

-  T 


Figure  2.  Excitation  of  the  dipole  antenna  using  the  electric  field 
component  on  the  wire  axis  at  the  center  of  the  wire. 


If  the  electric  field  at  the  center  of  the  antenna  is  defined  as  then 

r7,TTtr^-  -^(«M  ,  n=l,2, . ,nstop 


EiUX)  = 


where  V(nAt)  is  the  amplitude  of  the  excitation  source  and  nstop  is  the  maxim^  number  of  time  steps^^ 
The  current  I(t),  flowing  through  the  wire  center  can  be  sampled  by  taking  the  discrete  approximation  o 
lL"lf  the  x'and  y  components  of  the  magnetic  field,  H.  and  encircling  the  electric  field. 

Accordingly, 

with  the  values  of  indices  I.  J,  and  K  correspond  to  the  location  of  the  E/W.K)  electric  field  along  the 
wire  axis  at  the  center  of  the  antenna. 

In  order  to  perform  the  radiation  impedance  computations,  the  discrete  ^ 

voltages  at  the  antema  center  are  Fourier  transfoimed  except  for  the  case  of  the  impulse  excitation  which 
requires  only  the  discrete  frequency  results  of  the  current  values. 
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III.  RESULTS 


First,  the  dipole  antenna  is  fed  at  the  center  with  a  Gaussian  pulse  of  l.OV  maximum  amplitude 
of  the  form: 


V(t)  =  (3) 

with  p  =  64,  a  =  (4/(pAt))^  and  the  pulse  truncated  for  t  <  0  and  t  >  2pAt.  Second,  the  Gaussian  pulse  of 
equation  (3)  is  modified  and  used  as  the  excitation  source  so  that  a  modulated  Gaussian  pulse  is  obtained, 
i.e.. 


V(t)  =  cos(27c/o/) 


(4) 


where  the  center  frequency  fg  is  chosen  to  be  800  MHZ.  Finally,  an  impulse  is  applied  at  time  t  =  3At  to 
the  dipole’s  center  to  drive  the  radiating  antenna.  The  radiation  impedance  of  the  antenna  is  then 
computed  by  performing  a  Fourier  transform  analysis  to  the  time-domain  values  of  both  currents  and 
voltages,  obtained  from  the  FDTD,  at  the  feeding  point  of  the  dipole.  This  analysis  excludes  the  case 
where  the  antenna  is  driven  by  an  impulse,  where  the  frequency-domain  analysis  of  the  currents  is  only 
required  to  study  the  antenna  characteristics.  The  FDTD  computations  are  carried  out  for  nstop  =  8192 
time  steps  to  make  sure  that  all  transients  are  dissipated. 

For  comparison  purposes,  the  NEC  computer  code  is  employed  to  evaluate  the  antenna  radiation 
impedance.  However,  an  intensive  study  is  performed  to  determine  the  wire  radius  for  the  NEC 
computations  based  on  the  FDTD  cell  dimensions.  It  is  concluded  that  the  best  approximation  of  the 
radius  is  25%  of  the  FDTD  cell  dimensions.  The  FDTD  results  for  various  excitation  sources  along  with 
the  NEC  results  are  shown  in  Figure  3.  While  the  FDTD  calculations  are  valid  up  to  approximately  1.5 
GHz  based  on  having  20  FDTD  cells  per  wavelength,  the  thin  wire  approximation  for  the  MoM  code 
becomes  questionable  at  approximately  2.0  GHz.  As  seen  from  Figure  3,  the  FDTD  response  to  all  three 
sources  of  excitation  is  identical  and  in  fairly  close  agreement  with  the  NEC  results  even  up  to  2.0  GHz 
which  is  beyond  the  limitation  set  by  the  FDTD  technique. 


IV.  CONCLUSION 

It  is  shown  that  the  electric  dipole  antenna  can  be  effectively  analyzed  by  the  FDTD  technique 
when  driven  from  different  sources  of  excitation.  When  the  source  impedance  is  incorporated  in  the 
original  FDTD  formulations,  the  analysis  of  the  antenna  becomes  more  practical.  When  an  impulse 
excitation  is  used  to  drive  the  wire  dipole,  the  antenna  characteristics  can  be  studied  over  a  wide  frequency 
range  in  a  single  FDTD  code  execution.  Comparable  results  are  obtained  from  the  different  excitation 
sources.  However,  no  frequency-domain  analysis  of  the  voltage  source  is  required  to  obtain  the 
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impedance  and/or  admittance  of  the  antenna.  This  makes  the  antenna  analysis  more  efficient  and  simple. 


Frequency  (MHz) 


Figure  3.  Radiation  impedance  due  to  different  excitation  sources 
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Abstract:  Two  algorithms  are  combined  in  the  cylindrical  coordinates  for  accelerated  computations  of 
electromagnetic  fields  in  the  time  domain.  The  partial  eigenfunction  expansion  (PEE)  algorithm  is  used 
close  to  the  coordinate  system  origin  and  the  FDTD  algorithm  in  the  remaining  region.  This  eliminates 
the  use  of  a  prohibitively  short  time  step  if  high  circumferential  resolution  is  required  away  from  the 
origin.  A  significant  acceleration  of  computations  is  obtained,  due  to  the  superior  stability  and  higher 
efficiency  of  the  PEE  method. 

Introduction:  As  time  domain  techniques,  particularly  the  finite  difference  time  domain  (FDTD),  are 
increasingly  used  to  simulate  electromagnetic  fields  in  more  complex  and  larger  structures,  there  is  a 
need  for  new  algorithms  that  require  fewer  computer  resources  and  are  stable.  RecenUy,  an  efficient 
hybrid  PEE  (partial  eigenfunction  expansion)  -  FDTD  method  was  developed  for  shielded  structures  in 
the  Cartesian  coordinates  [1].  For  some  structures,  e.g.  a  corrugated  waveguide,  antenna  arrays  on  a 
cylinder  etc.,  it  is  advantageous  to  formulate  FDTD  in  cylindrical  coordinates.  But,  exceedingly  large 
computer  resources  are  required  when  a  high  circumferential  resolution  imposes  a  small  time  step.  The 
problems  associated  with  a  short  time  step  are  particularly  pronounced  at  low  frequencies,  where  time 
cycles  are  long.  The  solution  described  interfaces  two  methods:  the  PEE  and  the  classical  FDTD  in 
cylindrical  coordinates,  which  results  in  a  new  efficient  modeling  tool. 

Analysis:  The  geometry  of  the  structure  is  shown  in  Figure  1 .  The  PEE  algorithm  described  in  [1]  is 
used  inside  the  radius  where  the  field  components  are  expanded  in  a  Fourier  series  with  respect  to  0. 
A  typical  component  has  a  form: 

n=l 
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Between  Rj  and  an  FDTD  algorithm  in  cylindrical  coordinates  is  used.  The  transition  between 
the  two  algorithms  (interior  PEE  and  exterior  FDTD)  is  done  as  follows.  Fields  calculated  from  the 
PEE  on  the  surface  r  =  Rj  are  directly  imported  to  the  FDTD  algorithm.  In  the  opposite  direction,  the 
fields  given  by  the  FDTD  at  R,.+ Ar  are  used  to  obtain  the  expansion  coefficients  of  the  PEE  by  taking 
the  inner  product  of  each  function  [1].  In  cylindrical  coordinates  and  media  independent  of  (j)  this 
algorithm  resembles  the  PEE  -  FDTD  [2].  However,  there  are  two  important  differences  between  the 
two  algorithms.  The  PEE  computations  are  carried  with  several  expansion  terms  simultaneously,  while 
the  BOR-FDTD  considers  separately  each  mode.  Only  axi-symmetric  structures  can  be  analyzed  with 
BOR-FDTD  method,  while  the  hybrid  PEE-FDTD  method  can  be  applied  to  any  inhomogeneuity  in  the 
FDTD  region  of  the  computational  space. 

A  relative  stability  criterion  of  the  hybrid  method  compared  to  the  FDTD  can  be  derived  following 
the  method  given  in  [3]  as: 


^PEE 

^FDTD 


A  +  (A0)"' 
A  +  N^IA 


(2) 


where  A  is  a  constant  dependent  on  Ar  and  R; ,  and  N  is  the  number  of  terms  in  the  expansion  series. 
Thus,  the  smaller  the  A0  and  the  lower  the  number  of  terms  in  the  expansion  required,  the  greater  the 
increase  in  the  time  step.  Additionally,  the  dispersion  of  the  PEE  algorithm  is  inherently  lower,  because 
of  a  lack  of  discretization  in  0  direction,  so  more  accurate  results  are  to  be  expected. 

The  increase  in  speed  of  the  hybrid  method  as  compared  with  the  FDTD  is  due  to  two  factors:  (i) 
as  the  required  number  of  terms  in  the  expansion  is  usually  small,  a  larger  time  step  can  be  used 
resulting  in  fewer  computations,  and  (ii)  fewer  unknowns  are  used  to  evaluate  the  fields  close  to  the 
origin.  The  gain  in  speed  due  to  the  latter  factor  is  proportional  to  Rj  -  Rj )  for  a  large  number  of 
discretization  points  along  r  and  small  N.  The  overall  acceleration  is  a  product  of  both  factors,  and 
large  gains  in  speed  are  expected  for  large  Rj/R,,,^^-  The  gain  in  speed  is  also  accompanied  by  savings  in 
the  computer  memory. 


Numerical  Example:  To  test  the  hybrid  method  and  to  estimate  improvements  in  the  computation 
speed  and  accuracy,  the  cut-off  frequency  for  the  TM  modes  was  computed  with  Ar  =  R^^^  /23  and 
12,000  iterations.  The  interface  between  the  two  algorithms  was  at  R  =  12/23  Rj.  The  accuracy  of  the 
hybrid  and  FDTD  methods  is  compared  in  Table  1  for  different  number  of  discretization  points  in  0 
direction.  For  the  sake  of  brevity  only  the  results  for  the  TMj,  mode  are  given,  but  other  modes  with 
different  0  dependence  were  computed  simultaneously.  Figure  2  illustrates  the  effect  of  the  number  of 
terms  in  the  series  expansion.  As  the  number  of  expansion  terms  increases,  the  time  ratio  becomes 
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smaller,  however  it  remains  significantly  greater  than  1  for  all  cases  considered.  The  overall  speed  gain 
of  the  hybrid  method  with  both  factors  contributing  to  the  acceleration  considered  is  shown  in  Figure  3 . 
Higher  gains  are  achieved  as  the  area  covered  by  the  PEE  increases. 

Conclusions:  The  hybrid  PEE-FDTD  method  in  the  cylindrical  coordinates  provides  significant 
improvements  in  speed,  and  gains  in  memory  required  and  accuracy  in  analysis  of  quasi-symmetrical 
stractures  with  high  radial  resolution  for  large  radii.  This  method  is  particularly  attractive  for  structures 
that  are  homogeneous  close  to  the  z-axis,  but  have  subregions  that  need  to  be  modeled  with  high 
resolution  close  to  the  periphery. 
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TIME  STEP  RATIO 


Table  1.  Error  in  the  cut-off  frequency  for  the  TMj,  mode. 
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-0.02 

Fig.  1  Schematic  diagram  of  the  method  of  analysis 
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NUMBER  OF  POINTS  IN  PHI  DIRECTION 


Fig.  2  The  ratio  of  the  time  steps  of  the  hybrid 
and  the  FDTD  method  with  the  number  of  terms  in 
the  series  expansion  as  a  parameter  for  the  TM2, 
mode  (Ri=0.5R„,,). 


Fig.  3  Overall  speed  gain  for  various 
areas  in  which  PEE  algorithm  is 
applied  (N-2). 
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Abstract 

This  paper  will  present  results  in  computational  nonlinear  optics.  An  algorithm 
will  be  d^escribed  that  provides  absorbing  boundary  conditions  for  optical  pulses  in 
dispersive,  nonlinear  materials.  A  new  numerical  absorber  at  the  boundaries  has  been 
developed  that  is  responsive  to  the  spectral  content  of  the  pulse.  Currently,  ^o  ab¬ 
sorbing  boundary  conditions  exist  for  dispersive  materials;  rather  an  outer  re£on  of 
vacuum  must  be  added  and  then  absorbing  boundary  conditions  applied  to  the  vac¬ 
uum  region.  This  procedure  produces  reflections  at  the  boundary  between  regions. 
Absorbing  boundary  conditions  are  needed  for  dispersive  materials  m  applications  to 
both  linear  and  nonlinear  optical  device  simulations,  for  such  devices  as  wave  guides 
and  optical  fibers. 

Also,  results  will  be  shown  of  calculations  of  2-D  electromagnetic  no^inear  waves 
computed  by  directly  integrating  in  time  the  nonlmear  vector  Maxwell  s  equations^ 
The  results  will  include  simulations  of  “light  bullet  h^  pulses.  Here  diffraction  and 
dispersion  will  be  counteracted  by  nonlinear  effects.  Comparisons  will  be  shown  oi 
calculations  that  use  the  standard  boundary  conditions  and  the  new  ones. 
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1.  Introduction 

Transient  finite-difference,  finite-element,  and  finite-volume  methods  represent  powerful  tools 
for  the  analysis  of  complex  geometries.  The  finite-difference  time-domain  (FDTD)  method  in¬ 
troduced  by  Yee  nearly  30  years  ago  [1]  has  been  extensively  applied  to  a  wide  variety  of  prob¬ 
lems  that  range  from  microelectronics  to  biomedical  applications.  An  extensive  collection  of 
references  can  be  found  in  [2].  The  standard  FDTD  method  is  mature,  conceptually  simple,  and 
computationally  efficient  but  possesses  the  well-known  topological  limitation  of  staircasing  [3]. 
Finite-element  and  finite-volume  methods  provide  increased  geometrical  flexibility  by  conform¬ 
ing  elements  to  complex  features,  but  often  at  a  prohibitively  high  computational  cost.  Finite- 
element  methods  (FEM)  have  traditionally  been  applied  in  the  frequency  domain  to  electrostat¬ 
ics,  magnetostatics,  and  interior  dynamic  problems  [4],  but  more  general  application  is  becoming 
possible  through  the  use  of  edge  elements  and  accurate  absorbing  boundary  conditions  [5].  Fi¬ 
nite-element  formulations  based  on  explicit  and  implicit  time-differencing  methods  are  also  un¬ 
der  development  [6,7].  Published  work  on  transient  finite-volume  techniques  has  been  based  on 
the  application  of  computational  fluid-dynamics  (CFD)  methods  to  Maxwell’s  equations  [8]  and 
on  a  direct  generalization  of  rectilinear  FDTD  to  arbitrary  convex  polyhedrons  [9-15].  Both 
techniques  use  explicit  time  differencing,  but  the  CFD-based  methods  define  the  vector  electric 
and  magnetic  fields  on  a  single  grid,  whereas  the  generalized- Yee  formulations  use  two  inter¬ 
leaved  grids  referred  to  as  primary  and  dual  grids.  It  is  noted  that  some  finite-element  formula¬ 
tions  using  Whitney  elements  applied  to  Maxwell’s  two  curl  equations  also  use  a  dual-type,  sec¬ 
ondary  grid  to  evaluate  the  electric  field  along  edges  [6]. 

Much  of  the  previous  work  involving  time-domain  volumetric  solvers  applied  to  open-region 
problems  used  mapped-meshing  techniques.  Primary  grids  of  this  type  consist  of  skewed  hexa- 
hedral  and/or  wedge  elements.  The  grids  are  generally  block  structured  but  can,  of  course,  be 
referenced  in  an  unstructured  manner.  An  example  of  mapped-meshing  applied  to  microstrip 
problems  consists  of  forming  a  surface  grid  on  a  planar  geometry,  and  then  extruding  the  surface, 
or  shell,  elements  into  solid  elements.  The  resulting  grid  is  considered  to  be  2-1/2  dimensional 
because  the  grid  can  not  accommodate  geometry  variations  along  the  extrusion  direction 
(although  different  materials  can  be  incorporated  along  this  direction).  The  technique  has  been 
successfully  applied  by  Gedney  and  Lansing  [15].  For  general  geometries,  mapped-meshing  re¬ 
quires  mapping,  or  projecting,  one  surface  onto  another.  Geometries  must  be  zoned  using  this 
approach,  generality  is  limited,  and  user  intervention  is  often  high.  Free-meshing,  on  the  other 
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hand,  provides  increased  geometrical  flexibility  and  generally  lower  user  intervention.  The  re¬ 
sulting  grids  typically  consist  of  tetrahedral  elements,  although  fully  free-meshed  hexahedral 
grids  are  possible  in  some  situations  [16,17].  Tetrahedral  grids  are  computationally  expensive 
because  cell  count  can  become  exceedingly  high  for  practical  problems.  Compared  to  cubical 
hexahedral  grids  (FDTD),  filling  the  same  volume  with  free-meshed  tetrahedra  that  possess  an 
average  edge  len^  close  to  that  of  the  hexahedral  cells  will  lead  to  approximately  eight  times 
the  niunber  of  cells.  Consequently,  it  is  desirable  to  minimize  the  number  of  tetrahedra  in  the 
problem  space.  The  present  paper  addresses  using  local  tetrahedron  grids  interfaced  with  FDTD 
grids  and  applied  to  microelectronics. 

The  software  package  VOLMAX  rVolumetric  M^well  solver)  is  based  on  the  finite-volume  hy¬ 
brid-grid  (FVHG)  technique.  This  technique  permits  non-orthogonal,  unstructured  grids  to  be 
embedded  in  traditional  rectangular-cell,  structured-grid,  FDTD  [13,14].  The  FVHG  method  en¬ 
ables  high  complexity  regions  to  be  modeled  and  gridded  using  commercial  solid-modeling 
software.  The  resulting  complex  unstructured  grid  is  interfaced  to  simple  FDTD  cells  without 
spatial  interpolation  across  the  grid  interface.  This  is  accomplished  by  forcing  the  unstructured 
grid  to  terminate  on  a  closed  rectangular  surface  [14].  The  FVHG  technique  is  particularly  use¬ 
ful  for  open-region  problems  and/or  geometries  -with  local  regions  of  high  complexity.  The 
method  has  been  shown  to  provide  high  accuracy  in  scattering  applications. 

Microelectronics  applications  are  well  suited  to  an  FVHG  formulation.  This  is  because  umform 
microstrips  are  often  used  at  the  input  and  output  ports  of  the  device  under  study.  To  obtain  ac¬ 
curate  ^'-parameter  data,  these  microstrips  usually  extend  a  long  distance  and  require  accurate 
absorbing  boundary  conditions  at  the  line  terminations.  This  portion  of  the  problem  can  be 
modeled  accurately  and  efficiently  using  FDTD.  However,  the  device  imder  study  may  be  of 
high  complexity  and  require  a  conforming  grid.  This  complex  region  can  be  designed  using 
solid-modeling  software  and  then  gridded  with  free-meshed,  tetrahedral  elements. 

The  finite-volume  portion  of  the  FVHG  solver  [13]  is  based  on  a  generalization/simplification  of 
the  modified  finite-volume  technique  introduced  in  [9].  A  (variably  dissipative)  time-averaging 
scheme  is  used  to  obtain  long-term  stability.  Achieving  stability  for  tens-of-thousands  of  time 
steps  on  arbitrary  grids  with  explicit,  time-domain,  finite  volume  and  finite  element  schemes 
without  using  dissipation  remains  an  open  research  issue  across  many  disciplines.  It  is  noted  that 
the  non-dissipative  discrete  surface  integration  (DSI)  method  [10]  has  been  foimd  to  delay,  but 
not  eliminate,  late-time  growth  on  complex  grids  [13].  The  technique  used  by  the  FVHG  method 
permits  either  no  dissipation,  or  a  moderate  amount  of  dissipation,  depending  on  the  gnd  com¬ 
plexity.  Unfortunately,  the  “right  amount”  of  dissipation  is  generally  not  known  a  priori,  but 
guidelines  are  provided  here  that  have  been  found  to  be  successful  for  complex  tetrahedron  grids. 

This  paper  provides  an  overview  of  the  finite-volume  algorithm,  the  time-averaging  scheme, 
boundary  condition  implementation,  grid  generation,  and  time-step  definition.  Three  simple  mi- 
crostrip  applications  with  well-known  FDTD  solutions  are  examined.  The  examples  demonstrate 
the  accuracy  that  can  be  obtained  by  launching  microstrip-guided  waves  on  cubical  hexahedron 
(FDTD)  grids  and  through  a  dense,  linear  tetrahedra  region.  More  complex  geometries  that  fully 
exploit  the  power  of  the  FVHG  approach  to  microelectronics  will  be  topics  of  future  papers. 
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2.  Analysis 

The  finite-volume  method  can  accommodate  arbitrary  convex  polyhedron-element  primary  grids. 
The  vector  magnetic  fields  are  defined  at  the  centroids  of  the  primary  cells,  whereas  the  vector 
electric  fields  are  defined  at  their  vertices.  Dual  cells  contain  the  vertices.  The  dual  cells  can  be 
extremely  complex  and  typically  possess  many  faces,  edges  and  nodes.  A  preprocessor  builds 
the  dual  grid  from  the  primary  grid.  The  primary  grid  is  equivalently  labeled  an  H-grid,  whereas 
the  dual  grid  is  equivalently  labeled  an  E-grid.  A  typical  H-grid  tetrahedral  element,  referenced 


as  Hf,  is  shown  in  Fig.  2.1  (superscript  t  denotes  total  field).  Dual  cells  (not  shown)  surroimd 
the  four  vertices  and  are  referred  to  as  =  1-4-  The  dual  edge,  s^,  connects  the  centroids  of 


primary-cell,  /,  with  a  neighboring  cell,  k.  The  primary  edge,  Sp,  connects  Ey  and  E^-  The 

volume  of  primary  cell,  /,  is  denoted  while  the  volume  of  dual  cell,/,  is  denoted  Vj  ■  The 

area  of  face,  /,  on  primary  cell,  z,  is  denoted  by  A^p  while  the  area  of  face,  p,  on  dual  cell,/,  is 

i,l 

denoted  A^-.  j  and  n^*  denote  outward  normals  for  each  face  of  the  primary  and  dual 
JiP  J>P 


cells,  respectively. 


Maxwell’s  equations  are  given  by 

EAEt+aeE‘  =  VxH* 

n|jH*+a„,H‘=-VxE* 

where,  8,  jO,,  and  G  ffj  de¬ 
note  electric  permittivity,  electric 
conductivity,  magnetic  permeability, 
and  magnetic  conductivity,  respec¬ 
tively.  By  integrating  over  each  cell 
on  the  primary  and  dual  grids,  the 
finite-volume  time-domain  (FVTD) 
form  of  Maxwell’s  equations  is 


fE*  dVj  ^  "Ip  ’'***  ‘^J,. 


ye 


ye 


J,p 


(2.1) 


(2.2) 
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All  materials  are  defined  relative  to  primary  cells.  For  magnetic  materials  this  is  the  natural 
definition.  However,  electric  materials  are  associated  with  the  spatially  staggered  dual  cells  and 
therefore  the  permittivity  and  electric  conductivity  require  averaging.  The  average  values  are 
denoted  by  GJ  and  G  ,  respectively.  These  are  defined  as  follows.  The  average  permittivity 

on  dual  face,p,  is  defined  by 


pf=n _ 

^  I.V^ 


h 


(2.3) 


where  the  sums  are  taken  over  the  primary  cells  associated  with  the  dual  face.  The  average 
permittivity  for  the  dual  cellj,  is  constructed  by  forming 


Eg/ 


where  the  sum  is  taken  over  all  dual  faces  on  dual  cell,  7,  and  Nj 
on  the  cell.  The  electric  conductivity  is  similarly  defined. 


(2.4) 


denotes  the  number  of  faces 


Volume-Integral  Approximation 

The  volume  integrals  are  easily  evaluated  by  assuming  the  vector  field  remains  constant 
throughout  the  cell  volume.  This  leads  to  the  following  equations 


..•E 


11 


vj 


(2.5) 


mi 


H‘  =  - 


I 

I 


11  ”5 


A' 


i,l 


(2.6) 


A  first-order  approximation  to  the  vector  fields  within  cells  can  be  constructed  by  expanding  the 
fields  in  a  first-order  Taylor  series  and  applying  the  relationship  JJJV'F  dV  =  n  dA, 
where  denotes  a  scalar  field  and  n  denotes  an  outward  normal.  This  relationship  enables  the 
field  derivatives  to  be  easily  evaluated.  The  results  in  this  paper  are  based  on  constant  fields 
throughout  the  cell  volume.  The  effect  of  higher-order  representations  will  be  examined  in  a  fu¬ 
ture  paper. 
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Area-Integral  Approximation 

Similar  to  Madsen  and  Ziolkowski  [9],  the  cross  product  in  the  area  integral  is  expanded  as  a 
vector  triple  product  at  each  vertex  and  an  average  value  for  the  integral  is  constructed  by  sum¬ 
ming  over  all  vertices  of  the  face,  dividing  by  the  number  of  vertices,  and  multiplying  by  the  face 
area  (which  is  not  unique  for  non-planar  faces).  Note  that  for  tetrahedral  elements  the  faces  are 
triangles,  which  leads  to  the  following  simplification  (cf.  Fig.  2.2): 


V  V 


V 


Vl  p2 


Vl  p2 


rr 


E^-s^ 


eh  _  .  eh  eh 

>1  fp2  P  V  Vl 


(2.7) 


The  summation  is  taken  over  all  vertices 
of  primary  face,  (/,  1)  (cf  Figs.  2.1,  2.2). 
The  symbol,  represents  an  average 
field  along  the  edge  (defined  below). 
An  expression  similar  to  (2.7)  applies  to 
quadrilateral  faces,  with  the  overall  face 
normal  often  becoming  an  average  di¬ 
rection.  For  faces  with  an  arbitrary 
number  of  vertices,  as  is  generally  the 
case  on  the  dual  grid,  the  faces  are  sub¬ 
divided  into  triangles  from  which  the 
overall  area  is  computed.  For  these 
faces,  the  face-normal  generally  be¬ 
comes  an  average  direction. 


Time-Advancement  Scheme 

The  application  of  explicit  time-domain  methods  to  complex  cell  shapes  often  leads  to  a  well 
known,  late-time,  weak  instability  in  the  simulation  [13,18].  This  occurs  in  both  finite-element 
and  finite-volume  formulations  across  many  disciplines.  The  true  nature  of  the  instability  is  not 
fully  understood  but  is  believed  to  be  due  to  a  shifting  of  the  eigenvalues  of  the  discrete  formula¬ 
tion  relative  to  the  continuous  equations  [18].  CFD-based  methods  rely  on  a  dissipative  term  in 
the  discrete  equations  to  suppress  problematic  high-frequency  modes  [8].  The  impact  of  this 
term  as  applied  to  Maxwell’s  equations  has  not  been  thoroughly  addressed  in  the  literature.  For 
generalized-Yee  formulations,  the  use  of  time-averaging  to  create  a  variably  dissipative  time- 
integration  scheme  has  been  found  to  be  effective  [13].  However,  dissipative  time-integration 
can  lead  to  “over-damping”  of  the  simulation.  This  is  particularly  true  when  applied  to  lossless, 


540 


closed,  cavities.  Problems  that  possess  natural  dissipation,  such  as  radiation,  can  exhibit  slightly 
enhanced  attenuation  in  the  higher  frequencies  due  to  the  artificial  dissipation,  particularly  as  the 
grid  resolution  becomes  less  than  approximately  20  cells  per  wavelength. 

Traditional  FDTD  uses  a  second-order,  centered  time  difference  for  the  time  derivatives.  Sec¬ 
ond-order  accuracy  requires  that  the  differences  are  offset  in  time  between  the  electric  and  mag¬ 
netic  fields  which  gives  rise  to  the  well-known  “leap-frog”  time-differencing  scheme.  Applied  to 
the  magnetic  field  (with  the  electric  field  similar),  the  usual  centered  time  difference  is  written  as 
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lr=(«+l)A/ 


H’ 


«+3/2 


-  H’ 


«+l/2 


At 


(2.8) 


where  A  t  denotes  the  time  step.  A  (first-order)  time  average  for 


n+\/2 


is  the  following 
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where  CX  >  0,  but  practical  values  are  dependent  on  A  / .  The  average  corresponds  to  discrete 
time  samples  of  the  magnetic  field.  The  magnitude  of  the  Fourier  Transform  of  the  sampling 
function 
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where  8  (•)  represents  the  Dirac  delta  function,  is  given  by 
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(2.11) 


Here,  CD  corresponds  to  radian  frequency.  For  values  of  (X  less  than  infinity,  the  time  average 
provides  increasing  attenuation  with  increasing  frequency  (with  pracfical  upper  frequencies  se¬ 
lected  considerably  below  the  Nyquist  sampling  requirement).  Consider  a  uniform  grid  in  free 
space.  Define  At  =  A/icN)  ,  where  c  denotes  the  speed  of  light  in  vacuum,  A  denotes  the 
spatial  step  throughout  the  grid,  and  N  represents  a  positive  real  constant  required  for  stability. 
(0  At  can  now  be  written  as  {2n/N){AlX)  ,  where  X  denotes  free-space  wavelength. 
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Writing  the  grid  resolution  as  A  =  ?i  /  M,  where  M  is  z  positive  constant,  leads  to  the  result, 
CO  A  /  =  2tc  /  (A^  •  M).  A  graph  of  Eq.  2,1 1  as  a  function  of  •  M  is  shown  in  Fig.  2.3. 


As  an  example  of  using  Fig.  2.3, 
consider  a  grid  resolution  of  20 
cells  per  wavelength  {M  =  20) 
and  N  =  2  (which  is  typical  for 
three-dimensional  FDTD  on  a 
uniform  grid);  thus,  N  •  M  - 
40.  Choosing  a  =  40  leads  to 
an  attenuation  value  of 
0.999400.  At  80  cells  per 
wavelength  with  the  same  N 
and  a ,  the  attenuation  factor  is 
0.999963.  Although  these  val¬ 
ues  may  seem  insignificant,  they 
become  cumulative  in  a  time- 
advancement  scheme.  Just  as 
the  standard  Yee  algorithm  is  dispersive,  leading  to  increased  phase  error  over  propagation  dis¬ 
tance  [19],  the  presented  time-averaging  scheme  gives  rise  to  jfrequency-dependent  attenuation. 
Consider  pulse  propagation  in  one  dimension  over  a  distance  of  100  cells  with  N  =  2;  thus,  200 
time  steps  are  required  to  travel  the  line.  Using  Ct  =  40,  the  frequency  content  of  the  signal  at 
the  end  of  the  line  is  attenuated  by  approximately  12%  (1  dB)  at  20  cells  per  wavelength  and 
0.8%  (0.06  dB)  at  80  cells  per  wavelength  compared  to  lossless  propagation.  These  results  were 
obtained  by  raising  the  appropriate  value  from  Fig.  2.3  to  the  required  number  of  time  steps. 

For  fixed  values  of  a  and  M,  increasing  //(corresponding  to  decreasing  A  /)  leads  to  decreasing 
dissipation.  Although  this  may  appear  desirable,  the  resulting  dissipation  may  be  insufficient  to 
damp  the  growth  of  problematic  high-frequency  modes,  and  consequently,  a  smaller  value  of  a 
may  be  required.  For  embedded,  free-meshed,  tetrahedron  grids,  numerical  experiments  have 
lead  to  the  required  relationship,  A/  •  a  <  80,  for  long-term  stability  over  tens-of-thousands  of 
time  steps.  Taking  N  in  the  range  60-80  has  generally  been  found  to  be  successful. 

The  advancement  equations  based  on  time-averaging  are  the  following: 


(2.12) 
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The  effect  of  the  time  average  on  the  usual  Courant  condition  for  a  uniform,  free-space,  three- 
dimensional  FDTD  grid  is  easily  established  by  performing  a  Fourier  (von  Neumann)  stability 
analysis  [20]  of  the  advancement  equations.  The  analysis  is  straightforward  to  implement  md 
only  the  final  result  is  presented  here.  In  terms  of  the  filter  parameter,  a ,  the  resulting  equation 
for  determining  the  relationship  between  the  time  and  spatial  steps  is  given  by 

(a  +1)^  +2p3  [2(a+2)2  Ti-a(a+l)]+  P^  [a? -2  (a +1)]+  2a  P  +1  =  0  (2.14) 


where  ||3|<1  for  stability,  and  cAt  = 


Sample  calculations  are  shown  in  Table  2.1. 


It  is  seen  that  the  Courant  condition  slightly  tightens  with 
decreasing  a.  Although  it  is  not  possible  to  derive  a 
similar  rigorous  relationship  for  arbitrary  unstructured 
grids,  a  similar  conclusion  has  been  observed  numerically. 

FDTD  Contribution  and  Edge-Projected  Field 

Knowledge  of  the  vector  fields  at  cell  vertices  enables  the 
Table  2.1  calculation  of  an  average  field  projected  along  coimecting 

EffectofTime  Average  on  Uniform-  edges.  Arithmetic  or  volumetric  averaging  can  be  used. 

Grid  Courant  Condition  However,  the  approach  does  not  guarantee  that  the  fields 

are  divergence  firee  on  a  single  cell  level,  and  the  algo¬ 
rithm  does  not  reduce  identically  to  an  FDTD  form  when  cells  become  orthogonal.  These  issues 
are  rectified  by  incorporating  an  FDTD  correction  to  the  edge-projected  vector  field.  With  refer¬ 
ence  to  Fig,  2.1,  Ampere’s  and  Faraday’s  laws  specialized  to  the  field  components  normal  to 
primary-face  (i,  /),  and  dual-face,  (j,  p),  are 
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Where  Hi  and  Ep  denote  the  (face)  normal  components  of  the  magnetic  and  electric  fields 
(traditional  FDTD).  The  fields  are  assumed  to  remain  constant  over  the  area  integrals.  The  time- 
averaging  scheme  discussed  in  the  previous  section  is  used  for  the  time  derivatives.  The  “face 
average”  permittivity  and  electric  conductivity  discussed  above  have  been  used  (cf  Eq.  (2.3)).  In 

addition,  a  face  average  for  the  permeability  (  M' )  and  magnetic  conductivity  {<5  has  been 
introduced.  This  is  because  the  evaluation  face  is  located  at  the  interface  of  two  cells.  The  av¬ 


erage  permeability  is  defined  by 


4 = (-4  w + 4 


The  magnetic  field  projected  along  dual-edge,  sj,  is  finally  constructed  by  forming 


Ht  h-H‘ 


h 


(2.17) 


This  expression  directly  incorporates  the  FDTD  term  into  the  average  vector  field.  Note  that 


when 


nh  -sf 

i,l  I 


=  1,  the  expression  returns  only  the  FDTD  result.  Arithmetic  averaging  of  the 


vector  fields  at  the  edge  endpoints  has  been  used.  The  average  electric  field,  projected  along 
primary-edge,  Sp,  is  similar.  Note  the  use  of  edge-projected  fields  in  Eqs.  (2.7, 2.15, 2.16). 


Boundary-Condition  Implementation  for  Open  Conductors 

When  a  primary  grid  is  generated  for  an  inhomogeneous  region,  nodes  lie  along  the  material  in¬ 
terfaces  (cf  Fig.  2.4).  Dual  cells  associated  with  these  nodes  extend  across  the  interface  which  is 
why  material  averaging  is  used.  In  the  case  of  a  conducting  sheet,  the  boundary  dual  cells  be¬ 
come  split,  either  completely  or  partially,  depending  on  the  location  of  the  primary-cell  node. 
Direct  computation  for  the  vector  electric  field  is  no  longer  possible  in  the  usual  sense  for  these 
cells.  Of  course,  the  tangential  component  of  the  vector  field  is  zero,  but  the  normal  component 
generally  is  not.  A  similar  situation  occurs  with  traditional  FDTD,  but  due  to  its  rectilinear  na¬ 
ture,  no  complication  occurs.  However,  a  complication  does  occur  with  non-orthogonal  cells 

whenever  the  computation  of  •  s^,  where  denotes  a  primary  edge  with  one  node  on  the 


544 


conductor,  doesn’t  simplify  to  the  FDTD  term,  Ep.  In  this  case,  the  following  alternative  ex¬ 
pression  for  E*  •  is  used  (noting  xE*,  =  0) 


E‘-s^  =  i  Et+(Ksp-Et  In 


sppp- 


r^p]'^j,p 


Fig.  2.4.  Dual  cells  along  a  conducting  interface. 


surface  normal  at  the  boundary 
node  and  eJ  denotes  the  known 
vector  field  associated  with  edge, 
Sp.  The  average  surface  normal  is 

constructed  by  forming  a  weighted 
average  of  neighboring  face  nor¬ 
mals  using  only  those  faces  resid¬ 
ing  on  the  conductor.  Note  that 
expression  (2.18)  incorporates  the 
known  zero  tangential  field  at  the 
boundary  node,  but  relies  upon  the 


vector  field  off  the  surfece  to  provide  the  estimate  for  the  (surfi.ee)  normal  component  As  be- 
fore  the  FDTD  contribution  is  used  to  correct  the  (face)  normal  component  (which  has  been 
fouiid  to  be  a  critical  contribution).  Improved  representations  remain  an  active  area  of  research. 


Grid  Generation  and  Selection  of  Time  Steps 

The  FVHG  algorithm  implemented  here  uses  the  CAD  package  I-DEAS  [21]  to  generate  s^d 
models,  fiee-meshed  linear  tetrahedron  grids,  and/or  mapped-mesh  hexahedraFwedge  gnds.  The 
method  to  terminate  the  tetrahedron  grid  and  connect  a  structured  FDTO  grid  can  be  found  m 
Ref  [14]  The  method  is  based  on  embedding  the  geometry  of  interest  in  a  series  of  rectangular 
boxes  that  are  partitioned  with  quadrilateral  sub-surfaces.  These  surface  partitions  force  the  t^- 
rahedron  grid  to  terminate  such  that  the  surface  nodes  will  spatially  align  with  the  nodes  of  the 
structured  FDTD  grid  that  will  be  connected.  The  creation  of  the  proper  surface  potions  is 
easily  automated  within  I-DEAS.  The  direct  interface  to  hexahedral  cells  is  realized  throu^gh  the 
use  of  seven-faced,  six-sided  elements  called  “wrapper  cells.”  These  cells  are  generated  through 
the  pre-processor  as  a  consequence  of  the  topology  of  a  hexahedral  element  being  mterf^ed  to 
the  bases  of  two  tetrahedra.  The  wrapper  completely  encloses  the  unstructured  region.  The  re¬ 
sulting  grid  (including  the  wrapper)  represents  the  unstructured  primary  grid.  The  correspondmg 
dual  grid  is  constructed  during  the  pre-processing  phase.  It  is  noted  that  because  the  use  of 
wrapper  cells  enables  the  connection  of  mapped-mesh  hexahedral  elements  to  firee-meshed  tetra¬ 
hedra,  these  two  gridding  methods  can  be  logically  combined  within  I-DEAS. 
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The  primary  and  dual  grids  are  scanned  to  determine  their  maximum,  minimum^  and  average 
edge  lengths.  The  time  step  to  be  used  in  the  unstructured  region  is  related  to  the  minimum  edge 
length,  but  it  can  generally  be  chosen  considerable  larger  than  what  the  absolute  smallest  edge 
length  would  dictate.  The  following  provides  guidelines  for  the  choice  of  the  unstructured-grid 
time  step  obtained  through  numerical  experiments. 

Define  the  ratio  of  the  maximum  to  minimum  edge  lengths  on  the  primary  and  dual  grids  to  be 
Rp  and  Rc^,  respectively,  and  let  =  [[^  lTiax([[/?^]],[[i?(^]])  ]],  with  [[•]]  denoting 

the  closest  integer  greater  than  the  argument,  and  ^  Define  Atn  =  Atg  / Nu, 

where  Atn  denotes  the  time  step  in  the  unstructured  grid  and  Atg  denotes  the  time  step  in  the 
structured  (FDTD)  grid.  The  unstructured-grid  time  step  becomes  an  integral  sub-division  of  the 
FDTD  time  step  which  enables  this  region  to  be  advanced  several  sub-time  steps  for  each  time 
step  in  the  FDTD  grid.  Second-order  time-interpolation  has  been  found  to  be  an  effective  means 
to  time-couple  the  two  regions  [13].  Assuming  a  uniform,  free-space  FDTD  grid  with  spatial 
step,  A,  Atg  =  A  /  (c  Ng).  For  this  type  of  grid,  Ng  is  typically  defined  to  be  equal  to  2  (but 

could  be  taken  as  small  as  V3).  In  a  free-space  unstructured  region,  At^  =  A  /  (c  Ng 
It  is  assumed  that  time  averaging  is  only  used  in  the  unstructured  region,  and  as  previously 
shown,  time  averaging  slightly  tightens  the  Courant  condition.  This  is  why  the  parameter  ^  can 
not  be  uniquely  defined.  As  the  filtering  effect  increases  (corresponding  to  decreasing  CX ),  ^ 
generally  needs  to  be  chosen  toward  the  upper  end  of  the  defined  range.  Having  defined  the  time 
step,  the  dissipation  effect  is  easily  estimated  from  Fig.  2.3  by  setting  N  =  N if  •  Ng. 

3.  Microstrip  Applications 

Three  well  known  rectangular  microstrip  geometries  have  been  selected  to  study  the  effect  of  lo¬ 
cal  embedded  tetrahedron  grids  on  accuracy.  Issues  involving  grid  reflection  at  the  hexahedron- 
tetrahedron  interfaces,  propagation  through  dense  tetrahedron  regions,  sub-time  stepping,  and  the 
effect  of  dissipation  in  the  unstructured  grid,  are  examined  through  these  examples. 

Uniform  Microstripline 

As  a  first  example,  the  simple  microstrip  shown  in  Fig.  3.1  was  investigated.  The  substrate  rela¬ 
tive  permittivity  was  2.2.  The  geometry  was  modeled  by  a  uniform  collection  of  hexahedral 
elements  with  an  edge  length  of  0.1985  mm,  along  with  an  embedded  tetrahedron  region  with  an 
average  edge  length  of  0.1985  mm.  The  location  and  form  of  the  tetrahedron  region  are  shown  in 
Figs.  3.1  and  3.2.  The  microstrip  was  locally  terminated  with  a  simple  first-order  Mur  absorbing 
boundary  condition  (ABC).  Both  the  hexahedron  and  tetrahedron  grids  shown  in  Fig.  3.2  were 
created  within  I-DEAS  and  jointly  define  the  unstructured  grid  (excluding  the  wrapper).  Upon 
adding  the  wrapper  during  the  preprocessing  phase,  the  unstructured  grid  was  embedded  in  a  uni¬ 
form  FDTD  mesh  that  measured  202  x  68  x  88  cells  in  the  x,  y,  and  z  directions,  respectively. 
The  FDTD  cell  size  was  also  0.1985  mm.  The  FDTD  mesh  was  terminated  with  second-order 
Mur  boundaries.  In  the  FDTD  mesh,  Ng  was  set  to  2,  and  a  =  QO.  In  the  unstructured  region. 
Nil  =  3  (3  sub-time  steps),  and  a  =  1 2  (based  on  the  previous  guideline,  Ng  •  Nu  •  a  <  80). 
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Fig.  3.1.  Uniform  microstrip  test  geometry.  (Dimensions  in  mm.) 


The  microstrip  was  driven  by  adding  a  gaussian  source  between  the  power  and  ground  planes  at 
the  location  shown  in  Fig.  3.1.  Fig.  3.3  shows  the  voltage  under  the  midpoint  of  the  power  trace 
at  the  input  and  output  observer  planes.  A  comparison  is  made  by  replacing  the  FVHG  tetrahe¬ 
dron  region  with  cubical  hexahedral  cells  so  that  the  finite-volume  algorithm  resorts  exclusively 
to  the  FDTD  term.  Although  this  comparison  mesh  is  now  exclusively  FDTD,  the  microstrip 
remains  unstructured  with  the  same  values  for  Nn  and  (X ;  this  permits  a  careful  examination  of 
guided- wave  propagation  through  tetrahedrons  and  across  hex-tet  interfaces.  As  shown  in  Fig. 

3.3,  the  results  at  the 
two  observers  are  nearly 
identical  for  the  two 
grid  types.  The  only 
noticeable  difference  is 
a  slight  delay  at  the  out¬ 
put  observer  when  the 
tetrahedrons  are  present 
(FVHG  results).  This  is 
most  likely  due  to  the 
increased  cell  density 
associated  with  the  tet¬ 
rahedrons. 

Fig.  3.2.  Portion  of  unstructured  grid  for  Fig.  3. 1  showing  tetrahedra  region, 
which  extends  from  one  hexahedral  cell-width  above  the  power 
trace  to  the  ground  plane. 
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Fig.  3.3.  Results  at  input  and  output  planes  based  on  all  faexahedral  (FDTD)  and  embedded  tetrahedral 
(FVHG)  cells. 


Fig.  3.4a.  Difference  of  Fig.  3.3  results  at  input 
observer. 


Fig.  3.4b.  Normalized  spectrum. 


The  difference  between  the  FDTD  and  FVHG  results  at  the  input  observer  is  shown  in  Fig.  3.4a. 
This  reveals  the  subtle  propagation  differences  between  the  grid  types.  The  (peak)  voltage  dif¬ 
ference  is  seen  to  be  nearly  three  orders  of  magnitude  below  the  peak  voltage  in  Fig.  3.3.  It  is 
believed  that  this  can  be  further  reduced  with  improved  boimdary  conditions  for  open  conductors 
(cf.  Section  2).  The  spectrum  of  the  difference  field,  normalized  to  the  incident-field  spectrum,  is 
shown  in  Fig.  3.4b.  This  result  can  be  considered  the  return  loss  (iSl  1)  of  the  tetrahedra  region. 
In  the  substrate,  the  FVHG  grid  resolution  is  approximately  20  “cells”  per  wavelength  at  50  GHz 
based  on  the  maximum  of  the  average  edge  lengths  of  the  primary  and  dual  grids. 
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Low-Pass  Filter 

The  second  example  is  the  well  known  low-pass  filter  originally  studied  by  Sheen,  et  al,  [22].  A 
portion  of  the  filter  modeled  in  the  current  paper  is  shown  in  Fig.  3.5a.  Linear  microstrip  lines 
connect  to  the  input  and  output  ports.  The  geometry  shown  in  Fig.  3.5a  was  modeled  with  tetra¬ 
hedral  elements  with  an  average  edge  length  of  0.264667  mm.  This  portion  of  the  overall  grid 
contained  approximately  50,000  tetrahedra  and  is  shown  in  Fig.  3.5b.  Uniform  hexahedral  ele¬ 
ments  with  an  edge  length  of  0.264667  mm  were  used  to  model  the  microstrips  that  connect  to 
the  input  and  output  ports  depicted  in  Fig.  3.5a;  these  element  were  created  using  mapped-mesh 
techniques  within  I-DEAS.  It  is  noted  that  some  modeling  flexibility  has  been  lost  by  forcing  the 
tetrahedra  to  interface  with  these  hexahedral  elements;  this  accounts  for  the  slight  shifting  of  the 
port  locations  and  the  reduced  trace  widths  relative  to  the  actual  geometry  shown  by  Sheen,  et 
al,  [22].  The  complete  unstructured  grid,  including  the  wrapper,  is  shown  in  Fig.  3.6.  This  grid 
was  embedded  in  a  large,  uniform  FDTD  grid  that  measured  180  x  66  x  148  cells.  Second-order 
Mur  boundaries  terminated  the  FDTD  grid.  The  FDTD  cell  size  was  0.264667  mm.  The  mi¬ 
crostrip  was  driven  and  terminated  similar  to  the  previous  example.  The  source  plane  was  lo¬ 
cated  16  cells  fi-om  the  left  side  of  the  unstructured  grid  shown  in  Fig.  3.6.  The  number  of  sub¬ 
time  steps  in  the  imstructured  grid  was  4,  and  the  stabilization  parameter  was  set  to  10. 


The  location  of  the  observer  on  the  input  microstrip  was  under  the  midpoint  of  the  power  trace 
and  placed  three  cells  fi-om  the  edge  of  the  left  side  of  the  unstructured  grid  shown  in  Fig.  3.6. 
The  observer  on  the  output  microstrip  was  located  16  cells  fiom  the  right  side  of  this  grid.  The 
operating  range  for  this  circuit  is  less  than  10  GHz.  The  time  response  at  the  input  observer  is 
shown  in  Fig.  3.7a.  The  corresponding  result  for  51 1  is  shown  in  Fig.  3.7b.  Results  at  the  output 
observer  are  shown  in  Figs.  3.8a,b.  The  simulation  ran  for  20,000  time  steps  in  the  unstructured 
grid  (5,000  in  the  FDTD  grid)  and  required  approximately  8  hours  using  8  processors  of  a  Sun 
Spaicserver  1000  (with  50  MHz  Sparc-51  chips,  which  are  not  vector  processors). 


Fig.  3.5a.  Low-pass  filter  model.  Substrate  relative 
permittivity  was  2.2.  (Dimensions  in  mm.) 


Fig.  3.5b.  A  section  of  the  tetrahedron  grid 
that  encloses  the  low-pass  filter. 
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Fig.  3.6.  Full  unstructured  grid  for  low-pass  filter  including  connecting  traces  at  input  and  output  ports. 
Tetrahedra  region  shown  in  Fig.  3.5b  is  enclosed  in  the  mid-section. 


for  low-pass  filter. 


Fig.  3.7a.  Time  response  at  output  observer 
for  low-pass  filter. 


Fig.  3.8b.  Insertion  loss. 


550 


Microstrip  Patch  Antenna 

The  final  example  is  the  well-known  patch  antenna  originally  studied  by  Sheen,  et  a/.,  [22].  A 
portion  of  the  antenna  modeled  here  is  shown  in  Fig.  3.9.  For  this  problem,  a  low-density  hexa¬ 
hedron  grid,  with  an  edge  length  equal  to  the  substrate  width  (0.794  mm),  was  used  with  a  high- 
density  tetrahedron  grid  of  the  geometry  shown  in  Fig.  3.9.  A  surface  grid  with  an  average  edge 
length  of  0.3  mm  was  placed  on  the  patch  antenna.  A  portion  of  the  unstructured  grid  showing 
the  grid  transition  is  shown  in  Fig.  3.10.  Tlie  driving  microstrip,  including  the  source,  observer, 
and  termination  planes,  was  in  the  low-density  hexahedron  region.  The  observer  was  located  un¬ 
der  the  midpoint  of  the  power  trace  and  positioned  2  cells  from  the  left  edge  of  the  grid  shown  in 
Fig.  3.10.  A  local  first-order  Mur  boundary  condition  was  applied  at  the  left  edge  of  this  grid, 
and  the  source  plane  was  located  15  cells  from  the  left  edge.  Upon  adding  the  wrapper,  the  un- 


Fig.  3.9.  Model  of  patch  antenna.  Substrate  relative  permittivity  is  2.2.  The  input  port  is  at  the  trace 
“stub”  on  the  left  side  of  the  figure.  (Dimensions  in  mm.) 


structured  grid  was  placed  in  a  uniform  FDTD  grid  that  measured  96  x  44  x  60.  The  FDTD  ^d 
used  a  uniform  cell  size  of  0.794  mm  and  was  terminated  with  second-order  Mur  boundaries. 
Note  that  the  hexahedral  grid  only  provides  about  13  cells  per  wavelength  in  the  substrate  at  20 
GHz. 

Usmg  the  high-density  surface  grid  gave  rise  to  a  maximum  to  minimum  edge  length  ratio  on  the 
primary  grid  of  11.4  {Rp),  while  on  the  dual  grid  this  ratio  was  17.25  {Rd)-  Twelve  sub-time 

steps  were  required  in  the  unstructured  grid  {Ny,  =  12),  =  2,  and  the  stabilization  parame¬ 

ter,  a ,  was  set  to  3  (based  on  the  guideline.  Ns  *  Ny  -  a  <  80).  The  unstructured  grid  con¬ 
tained  approximately  36,000  tetrahedra  and  520  hexahedra  (excluding  the  wrapper).  Results  at 
the  input  observer  are  shown  in  Figs.  3.11a,b.  The  simulation  ran  for  :>6,000  time  steps  in  the 
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High  density  tetrahedrons 


Low  density  hexahedrons 


Fig.  3.10.  Partial  grid  of  patch  antenna.  The  feeding  trace,  modeled  with  uniform  quadrilateral 
shell  elements,  resides  in  the  low-density  hexahedron  mesh  and  connects  to  the  input 
port  of  the  patch  antenna.  At  this  point,  the  grid  transitions  to  a  dense  triangular-element 
surface  mesh  surrounded  by  tetrahedral  elements. 


unstructured  grid  (3,000  in  the  structured  grid)  and  required  approximately  6  hours  using  6  proc¬ 
essors  of  the  Sun  Sparcserver  1000.  It  is  noted  that  the  patch  antenna  required  approximately  1 
hour  to  generate  the  solid  model,  tetrahedra  grid,  and  mapped-mesh  hexahedra  grid  for  the  input 
trace  extension  using  I-DEAS  (cf.  Fig.  3.10),  as  well  as  all  primary  and  dual  grid  information 
from  the  preprocessor. 


Fig.  3.11a.  Time  response  at  input  observer 
for  patch  antenna. 


Fig.  3.11b.  Return  loss. 
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4.  Conclusions 

This  paper  has  examined  the  effect  of  local,  ffee-meshed  tetrahedron  grids  in  uniform-grid  FDTD 
as  applied  to  microstrip  geometries.  By  comparing  with  well  known  FDTD  solutions,  the  FVHG 
approach  was  shown  to  provide  very  accurate  results  when  specialized  to  rectangular  geometries. 
Numerical  reflections  at  the  hexahedron-tetrahedron  interfaces  have  typically  been  found  to  be 
suppressed  by  at  least  40  dB.  Much  of  the  reflection  is  believed  to  be  due  to  current  boundary 
condition  approximations  in  the  finite-volume  formulation  as  applied  to  open  conductors.  Im¬ 
proved  approximations  remain  an  active  topic  of  research.  Abrupt  changes  in  grid  density  and 
different  dispersion  characteristics  between  hexahedrons  and  tetrahedrons  also  contribute.  How¬ 
ever,  the  dynamic  range  that  is  cunently  obtainable  is  sufficient  for  most  practical  microstrip 
applications.  Extensive  numerical  experimentation  has  revealed  that  a  good  indicator  for  the  grid 
resolution  of  a  tetrahedron  primary  grid  is  based  on  the  maximum  of  the  average  edge  lengths  of 
the  primary  and  dual  grids.  The  upper  firequency  resolution  is  then  obtained  by  equating  this 
distance  to  the  wavelength  divided  by  twenty.  Because  the  FVHG  approach  uses  commercial 
solid-modeling  and  local  free-meshed  grids,  the  efficient  modeling  of  high  complexity  devices  is 
possible.  Examples  will  be  discussed  in  future  papers. 

The  development  and  application  of  unstructured  finite  volume  and  finite  element  methods  to 
transient  computational  electromagnetics  is  only  beginning  to  evolve.  Important  research  areas 
in  FDTD  such  as  radiation  boundary  conditions  and  sub-grid  models  will  undoubtedly  be  revis¬ 
ited  in  this  context.  Additional  research  into  the  late-time  stability  of  explicit  difference  schemes 
on  complex  grids  is  also  needed.  This  paper  has  utilized  an  effective,  variably  dissipative  time¬ 
averaging  scheme  to  obtain  stability  in  the  unstructured  grid,  but  non-dissipative  time- 
differencing  is  ultimately  desired. 
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Abstract 


We  have  developed  a  new  nonlinear  hybrid  vector  finite  element  method  to  model  wave  propagation 
in  nonlinear  media.  This  method  is  based  on  the  vectorial  wave  equation  and  can  adequately  descnbe 
the  vectorial  properties  of  fields  in  nonlinear  structures.  With  this  approach  we  have  studied  such  effects 
as  optical  soliton  formations,  self-focusing  effects. 


1  Introduction 

The  properties  of  nonlinear  optical  waves  guided  by  dielectric  nonlinear  waveguides  axe  of  great  interest 
because  of  their  unique  features,  such  as  sohton  formation  and  subsequent  propagation  and  self^ocusi^ 
effects  [1,  2,  31.  These  effects  and  their  potential  use  in  all  optical  signal  processing  devices  have  stimdated 
the  theoretical  investigation  of  the  behavior  of  the  nonlinear  guided  waves  [3,  4].  Under  the  assumptm  of 
negUgible  transverse  effects,  analytical  solution  can  be  obtained  for  many  c^s  by  emploj^g  plane-wave 
approximation.  This  approximation  can  be  used  when  one  is  interested  only  m  the  evolutional  ^riation 
along  the  propagation  axis  z.  However,  the  introduction  of  transverse  effects,  which  ^e  unavoidable  m 
prartical  situations  because  of  the  finite  cross  section  of  the  optical  beam,  ^mphcates  the  problem  and  m 
many  cases  numerical  methods  are  needed  to  study  the  transverse  effects  of  optical  beams.  Consequently, 
the  growing  attention  has  been  paid  to  numerical  methods  for  studying  electromagnetic  wave  propagation 
in  nLinear  optical  media  [5,  6,  7,  8].  Among  the  available  methods,  three  techniques  are  most  widely  used 
and  provide  practical  information  for  various  nonlinear  phenomena.  They  are;  beam  propagation  method 
(BPM)  [9  10  11],  nonlinear  time  domain  finite  difference  (NI^FDTD)  method  [12,  13]  and  split-step  ^te 
element  method  (SSFEM)  [7].  In  our  effort  to  meet  an  increasing  demand  for  more  accurate  and  realistic 
nonlinear  optics  simulations,  we  have  developed  a  three-dimensional  nonlmear  hybrid  vector  fimte  element 
method  (NL-HVFEM)  to  study  different  phenomena  due  to  wave  propagation  m  nonlmear  Kerr-t^e 
Contrary  to  the  most  of  the  existing  one-dimensional,  scalar  models,  the  NL-HVFEM  approach  is  b^d 
upon  the  vector  Helmholtz  equation  and  thus  can  predict  '^®'^t°rial  pr2>er^s  ll  td  Z 

media  without  any  assumptions.  In  the  following  chapters,  we  describe  the  NL-HVFEM  approach  and  its 
application  to  the  modeling  of  nonlinear  wave  propagation  in  three-space  dimensions.  Numerical  results  for 
different  nonlinear  effects  are  given  and  comparisons  with  previously  reported  results  are  made. 


2  Basic  Equations 

We  consider  nonUnear  electromagnetic  wave  propagation  in  an  isotropic  medium,  which  is  uniform  in  r;  with 
an  arbitrary  shape  in  the  transverse  (xy)  plane,  having  intensity-dependent  relative  permittivity  Cr,  such 

that 

«.=<i  +  a/(|£:p)  W 

where  ei  =  nl  denotes  the  linear  relative  permittivity,  a  is  the  nonlinear  coefficient  and  /  ^  be  an  Mbitraiy 
the  intensity  |B|“  such  that  /(O)  =  0.  When  /(|B|^)  =  \Ei\  Eq.  (1)  corresponds  to  the  Kerr-type 
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nonlinearity.  Assuming  exp(ja;t)  dependence  of  EM  fields,  the  time-harmonic  form  of  Maxwell’s  equations 
is  given  by 


V  X  H  =  juJ€Q€rE 

V  X  E  =  -jojfioH  (2) 

The  assumption,  concerning  time  variation,  is  valid  as  long  as  the  pulse  duration  of  the  optical  beam  is 
longer  them  a  subnanosecond.  To  study  the  spadotemporal  behavior  of  short  pulses,  which  are  obtmnable 
with  modem  mode-lodced  laser  technology,  the  present  approcich  is  not  valid  and  a  nonlinear  time  domain 
method,  such  as  the  NL-FDTD  [12],  has  to  be  adopted.  Equations  (2)  can  be  reduced  to  a  single  vector 
equation  in  term  of  E.  The  resulting  Helmholtz  vector  wave  equation  is 

V  X  V  X  E  =  k^CrE  (3) 

where  ko  is  the  wavenumber  in  free-space.  Using  the  conventional  approximation  of  a  slowly  varying  envelope 
and  assuming  also  that  the  axis  of  the  propagation  is  z,  we  can  rewrite  E  as 

E  =  S{x,y,z)  exp  j (wf  -  0z)  (4) 

where  £  represents  slowly  varying  amplitude  of  the  original  field  E.  Furthermore,  we  define  the  transverse 
operator  Vr  as 


Vr  =  xdx  -t-  ydy 


and 


E^£r-\-z€^  V  =  Vr  -h  zd.,  (5) 

where  £r,£:  are  the  transverse  and  the  z  components  of  E,  respectively.  Finally,  based  on  Eqs.  (3)-(5),  the 
vector  Helmholtz  wave  equation  becomes 


{Vr  X  Vr  X  -  zV%£z  +  zV r  '  “  j^zVr  ‘  \ 

/  ~  =  0 

+d,  VrSz  -  dl£r  -3&^r£z+  0^£t  ) 


(6) 


Furthermore,  in  derivation  of  Eq.  (6)  we  have  considered  all  terms  and  no  approximations  were  made. 


3  NL-HVFEM  Formulation 

First  we  subdivide  the  entire  cross-sectional  domain  (xy  plane)  of  the  system  into  a  number  of  small  sub- 
domains,  using  finite  elements.  In  the  two-dimensional  case  one  way  of  achieving  this  is  by  Delaunay  tri  an¬ 
gulation.  Galerkin’s  weighted  residual  approach  has  been  used  to  construct  our  finite  element  formulation 
[14],  and  its  application  to  Eq.  (6)  leads  to  the  following  bilinear  form 

B{W,  E)=  J  (VrW,  •  djr  -  Wr  •  d^Vr£z  ~  2j0Wr  •  8 Jr  +  Wr  •  8%)  rffi 

(Vr  X  Wr)  •  [Vr  X  £r)  +  VrW^  ■  Vr£z  +  30^rWz  ’  Sr 

~j0Wr  •  VrSz  +  P'^Wr  '  Sr  ~  f^W  ■  trS 

where  17  is  the  cross  section  of  the  waveguide  and  W,E  are  the  testing  and  trial  fields,  respectively.  In  the 
current  formulation  the  hybrid  finite  element  approach,  namely  edge-elements  for  the  transverse  component 
and  nodal  FEM  for  z  component  of  electric  field,  is  applied  to  the  waveguide  cross  section  [15].  It  has  been 
shown  that  this  method  is  free  of  spurious  modes  and  results  in  a  reliable  numerical  solution.  By  setting 
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B{W,E)  =  0  for  every  W  in  the  testing  function  space,  one  comes  up  with  the  following  differential  matrix 
equation. 


and  the  matrices  are  defined  as 

[Mrr\  [Mrz]  [iV..]  1  l^rr]  [Frz]  0  1  f 

[M]=  [M.r]  0  0  ;  [F]=  [F..]  [F„]  0  ;  e=  e  (9) 

[  0  0  J  [  0  0  [Ntt]  L  £r  J 

where  e^,e,  are  column  vectors  of  coefficients  for  the  transverse  and  z  components,  respectively,  and 
denotes 


Moreover,  the  element  matrices  in  Eq.  (8)  are  given  by 

[Mrr]  :  -2jP  f  {Wr-Sr}dn 

Jn 

(M„)  : 

Jn 

[M,r]  :  f  {£r-'^rW.}dQ 

Ja 

[;^„]  :  j^{W,-er)dSl 

[Frr]  :  f  {(Vr  X  Wr)  '  (Vr  X  £r)  +  0^Wr  ’  Sr  “  •  tr£r}d£l 

Jq 

[Fr.]  :  -jp  f  {Wr-Vr£z}d^ 

[F,r]  :  jp  [  {VrW,-€r]dU 
Ju 

[F„]  :  f  {^rWz  •  VtSz  -  •  er£,}dil 

Ju 

Finally,  apphcation  of  the  finite  difference  method,  based  on  the  backward  difference  scheme,  to  Eq.  (8) 
yields 

([M]  -  [F]Az)e*+^  =  [M]e*  for  i  =  0, 1, 2...  (12) 

where  e*  corresponds  to  e  at  .  =  iAz,  eo  is  the  initial  state  and  Az  is  the  difference  step 
Furthermore,  we  would  like  to  emphasize  to  in  our  iterative  process  for  eadi  step  r  after 
vector  ei  we  have  to  update  not  only  relative  permittivity  Cr  from  Eq.  (1),  but  also  propagation  constant  p 
based  on  Eq.  (8).  Consequently  we  end  up  with  the  following  iterative  scheme 

(i)  Specify  p,eL,a,ko,e^  as  an  input  data. 

(ii)  Calculate  the  coefficient  matrices  [M],[F]  and  solver  Eq.  (12)  for  a  new  solution  e*+^ 

(iii)  Update  form  Eq.(l)  and  P  from  Eq.  (8). 

(iv)  Iterate  the  above  procedure  (ii)  and  (iit)  until  we  teach  the  desired  number  of  iterations  in  a  arection. 
FinaUy,  in  the  Iterative  procedure  the  fbUowing  stabiUty  a.n«tion  should  be  observed  to  ensure  numerical 
Stability  of  computed  results 

p^z  >  1.  (^^^ 

Step-by-step  calculation  of  Eq.  (12)  results  in  the  variation  of  the  field  profile  that  is  initially  fired  at  the 
entrance  (z  =  0)  of  the  waveguide. 
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4  Numerical  Results 


The  following  section  is  devoted  to  the  problem  of  the  behavior  of  light  beam  in  a  nonlinear  medium.  In 
the  first  example,  a  planar  nonlinear  waveguide  has  been  analyzed  and  numerical  results  are  shown.  In  the 
second  example,  a  so-called  self-focusing  effect  of  an  axially  symmetrical  optical  beam  has  been  studied. 

Nonlinear  Slab  Waveguide 

In  our  first  example  we  apply  NL-HVFEM  to  a  planar  nonlinear  waveguide  (NLWG)  and  compared  our  results 
with  previously  published  ones.  Shown  in  Fig.l  is  a  three-layered  slab-type  dielectric  optical  waveguide  that 
consists  of  a  linear  substrate,  a  linear  film  and  a  Kerr-type  cladding,  with  a  nonlinear  permittivity  given 
by  Eq.  (1).  Waveguide  parameters  are  chosen  from  Ref.(5),  i.e.,  linear  refractive  indices  of  the  substrate 
(x  <  -d),  film  (ill  <  d)  and  cladding  (i  >  d)  are  n,  =  1.55,  Uf  =  1.57  and  =  1.55,  respectively,  and 
nonlinear  coefficients  are  a,  =  a/  =  0;0,  Qc  =  1.5  x  10“^,  where  the  subscripts  s,  f,  c  refer  to  the  substrate, 
film  and  cladding,  respectively. 


°c  +  “clEl 


“f 

2a 


Figure  1:  The  geometry  of  the  non¬ 
linear  slab  waveguide. 


Input  power 

Figure  2:  Power  trapped  in  the  NLWG  Pt  versus  the  input 
power  Pin  for  the  beam  initially  shaped  as  TEq  linear  mode. 


The  numerical  experiment  consisted  of  launching  an  input  beam  of  a  fixed  profile  but  variable  power  into 
the  waveguide.  The  input  profile  was  chosen  to  be  the  linear  TEq  mode  of  the  linear  planar  waveguide.  The 
point  of  interest  is  the  way  in  which  the  nonlinear  system  sheds  the  excess  energy:  a  single  self-channeled 
wave,  or  soliton,  is  emitted  through  the  film-cladding  interface  into  the  nonlinear  medium  and  propagates 
away.  Figure  2  plots  the  power  trapped  in  the  waveguide  Pt  versus  the  input  power  Pin,  where  the  power  is 
defined  by 


P  —  j  E  y.  H*  'zdx  (14) 

and  X  is  the  transverse  coordinate  in  unit  of  w/c.  The  figure  clearly  shows  the  sequence  of  thresholds. 

For  the  low  input  power  the  beam  is  confined  in  the  linear  film  and  has  only  a  small  displacement  in  the 
nonlinear  region  and  no  soliton  is  emitted.  Only  with  the  input  power  is  greater  than  a  certain  threshold, 
one  readies  a  situation  when  a  single  soliton  is  emitted  in  the  nonlinear  cladding  (Fig.  3).  The  single  soliton 
emission  continues  with  the  power  of  the  soliton  staying  almost  constant  as  we  increase  the  input  power 
even  more  (this  can  be  deduced  from  the  almost  linear  slope  of  the  corresponding  portion  of  the  Pt  vs.  Pin 
curve).  Furthermore,  if  the  input  power  is  further  increased  we  should  expect  a  second  threshold  leading 
to  the  emission  of  the  second  soliton  and  so  on.  Indeed  the  trcinsmission  of  the  multi-solitons  through  a 
nonlinear  interface  was  observed  up  to  the  third  threshold  (Fig.3).  Moreover,  it  turns  out  that: 
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Figure  3:  Evolution  for  koa  =  2n  of  the  input  fields  initially  shaped  asTEo  mode  for  different  values  of  the 
input  power. 
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Nonlinear  optical  fiber 


In  our  second  example  the  NL-HVFEM  was  applied  to  study  self-focusing  effect  in  a  nonlinear  optical  fiber.  A 
number  of  theoretical  and  experimental  papers  have  been  devoted  to  self  focusing  of  wave  beams  in  nonlinear 
media  [1,  2,  3].  For  the  simulation  runs  we  fixed  the  initial  intensity  profile  to  be  a  Gaussian  distribution 
and  the  initial  phase  front  was  assumed  to  be  plane.  Consequently, 

£(r,0)  =  Eoexp(-^)  (15) 

where  a,  the  initial  waist,  has  been  taken  a  =  0.4m.  We  note  that  all  dimensions  and  parameters  of 
nonlinear  medium  were  chosen  simply  as  a  representative  case.  It  would  be  easy  to  tailor  the  problem  to 
some  specific  dimensions  and  materials.  Chosen  parameters  will  rather  simplify  our  simulations  and  will 
ensure  the  the  self-focusing  will  occurs  near  the  center  of  our  computational  grid.  In  our  study  we  have 
tried  several  grid  sizes  to  perform  numerical  simulations.  Most  of  them  were  run  for  approximately  6000 
triangular  elements  per  optical  fiber  cross  section  with  spatial  resolution  Sz  =  Ao/8.  Our  calculations  were 
limited  simply  by  the  amount  of  available  computer  memory.  We  would  like  to  study  here  conditions  under 
which  an  initial  Gaussian  beam  can  produce  its  own  dielectric  waveguide  and  propagate  without  spreading. 
Such  self-trapping  appears  to  be  possible  only  for  the  hight  intensity  of  an  input  beam  when  an  input  power 
is  greater  then  a  critical  one.  To  come  up  with  an  expression  for  a  critical  power  we  ran  our  simulations  for 
different  parameters  characterizing  nonlinear  medium  ,  such  as  and  a  as  well  as  for  different  frequency 
points.  Based  on  our  numerical  results,  the  critical  power  can  be  approximated  fairly  well  by 


where  Aq  is  the  w-avelength  in  vacuum.  To  obtained  the  corresponding  result  in  cgs  unit  we  divide  by  4tt€o 
and  result  in  the  formula  numerically  found  by  previous  researchers 

,  _  (1.22Ao)^c 


where  c  denotes  speed  of  light  in  vacuum.  Only  when  the  input  power  is  greater  them  the  light  beam 


Figure  4:  Radial  distribution  of  the  total  electric  field  for  nonlinear  medium  with  a  =  0.01  and  ei  =  2  for 
/  =  1.5GHz.  Curves  are  parameterized  by  z/Xo  for  a)  Pin/Per  =  1-243  and  b)  Pin/Per  =  1-720. 


transforms  into  the  self-trapped  regime.  If  we  define  the  self-focusing  length  as  the  distance  in  which  the 
intensity  of  an  optical  beam  reaches  the  maximum  then  from  Fig.4  we  conclude  that  the  growth  of  the  input 
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(b) 

Figure  5:  Numerical  simulations  for  nonlinear  medium  with  a  =  0.01  and  ex,  =  2  for  /  =  X.bGHz.  Curves 
are  parameterized  by  z/Xq  for  Pi„/Pcr  =  1.243  (left  column)  and  Pi„/Fcr  =  1.720  (right  column),  a)  Radial 
distribution  of  the  component  b)  Fringe  plot  corresponding  to  the  component. 


power  decreases  the  self-focusing  length.  The  characteristic  profiles  of  the  amplitude  P(r,z)  for  different  z 
are  also  shown  in  Fig.  4  for  a  nonlinear  medium  with  =  2  and  a  =  0.01. 

In  addition  to  the  self-trapping  behavior  of  an  optical  beam,  we  also  can  observe  a  new  effect  that 
may  occur  only  when  the  full  vector  nature  of  Maxwell’s  equations  is  taken  into  account.  As  an  input  beam 
propagates  in  the  z  direction,  the  self-focusing  effect  seems  to  be  limited  by  the  generation  of  the  longitudind 
field  component  The  growth  of  the  longitudinal  electric  field  component  of  an  initial  TE  optical  beam  is 
addressed  specifically  in  Fig.5  and  its  behavior  is  similar  to  that  of  the  transverse  component.  This  effect  of 
Ez  component  is  unavailable  from  scalar  modes  and  can  be  explained  only  as  a  complete  solution  of  vector 
Maxwell’s  equations  and  exhibits  the  nonlinear  energy  transfer  phenomena  associated  with  the  longitudinal 
electric  field  component.  Fig.5  presents  sequences  of  plots  that  illustrate  this  effect  in  the  self-focusing 
process. 


5  Conclusion 

The  NL-HVFEM  approach  described  in  this  paper  solves  directly  the  vector  Helmholtz  wave  equation  and 
is  applicable  to  optical  structures  of  arbitrary  geometry  and  reflection  index  distribution.  The  vectorial 
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properties  of  the  fields  are  fally  considered.  Using  this  method  we  have  studied  such  effects  as  soUton 
formation  cind  subsequent  propagation  in  a  nonlinear  slab  waveguide  and  self-focusing  effects  in  a  nonlinear 
optical  fiber.  For  these  cases  the  conventional  analytical  approach  is  of  little  use.  However,  fiill-wave 
numerical  solution  techniques,  such  as  the  NL-HVFEM  developed  herein,  can  be  used  to  study  general 
nonlinear  optical  systems. 
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Abstract 

In  this  paper,  we  apply  a  version  of  the  Fast  Multipole  Method  (FMM)  to  reduce  the  storage 
and  computational  requirement  of  the  boundary  integral  in  the  finite  element-boundary  integral 
method.  By  virtue  of  its  operation  count,  the  application  of  the  single-stage  FMM, 

results  in  substantial  speed-up  of  the  boundary  integral  portion  of  the  code,  independent  of  the 
shape  of  the  BI  contour.  We  will  discuss  the  efficiency  of  the  method  and  present  the  application 
of  this  technique  to  the  computation  of  electromagnetic  scattering  from  large  grooves  recessed 
in  a  ground  plane. 


1  Introduction 

Over  the  past  few  years  different  hybrid  versions  of  the  finite  element  method  have  been  explored 
for  application  to  scattering  by  composite  structures.  Among  them,  the  finite  element-boundary 
integral  equation  (FE-BI)  and  the  finite  element-absorber  boundary  condition  (FE-ABC)  methods 
have  been  quite  popular  and  extensively  applied  to  many  applications.  The  FE-BI  method  [1],[2] 
employs  the  exact  boundary  integral  equation  which  provides  an  independent  relation  between  the 
tangential  E  and  H  fields  on  the  mesh  outer  boundary  and  is  therefore  an  exact  method.  This  is  in 
contrast  to  the  FE-ABC  method  [3]  which  employs  an  approximate  truncation  operator  but  leads 
to  fully  sparse  systems.  On  the  other  hand,  the  FE-BI  method,  although  “exact”,  leads  to  a  partly 
full  and  partly  sparse  system  {see  figure  1)  and  is  thus  more  computationally  intensive.  For  special 
cases,  where  the  boundary  is  rectangular  or  circular,  the  FFT  can  be  used  [4]  to  reduce  the  memory 
and  CPU  requirements  to  iVlog  N.  However,  in  general,  the  boundary  integral  is  not  convolutional 
and  in  that  case  the  CPU  requirements  wiU  be  of  0{N^)  where  Nb  denotes  the  unknowns  on  the 
boundary. 

In  this  paper,  we  apply  a  version  of  the  Fast  Multipole  Method  (FMM)  to  reduce  the  storage 
and  computational  requirement  of  the  boundary  integral  when  the  size  of  the  contour  is  large.  By 
virtue  of  its  0(iV^-®)  operation  count,  the  application  of  the  FMM,  results  in  substantial  speed-up  of 
the  boundary  integral  portion  of  the  code,  independent  of  the  shape  of  the  BI  contour.  We  consider 
the  appBcation  of  this  technique  (referred  to  hereon  as  the  FEM-FMM  method)  to  scattering  from 
a  material-filled  groove  in  a  conducting  plane.  The  FEM  is  employed  to  formulate  the  fields  within 
the  cavity  and  establish  a  relationship  with  those  at  the  aperture.  The  fields  external  to  the  groove 
are  expressed  as  an  integral  over  the  aperture  and  a  system  of  integral  equations  is  then  obtained 
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by  enforcing  field  continuity  across  the  aperture.  To  reduce  the  storage  requirement  and  speed  up 
the  computation  of  the  boundary  integral,  an  approximate  version  of  the  fast  multipole  method  [5] 
is  employed. 

2  Formulation 

Consider  a  filled  PEC  groove  of  width  w  and  depth  d  in  an  otherwise  uniform  ground  plane  as 
shown  in  figure  2(a).  The  material  filling  the  groove  has  a  permittivity  tr  and  permeability  fXr- 
The  free  space  region  exterior  to  the  groove  is  denoted  as  region  I  while  the  groove  itself  is  denoted 
as  region  II  (see  figure  2).  The  fields  in  the  two  regions  are  decoupled  by  closing  the  aperture 
with  a  perfect  conductor  and  introducing  an  equivalent  magnetic  current  based  on  the  equivalence 
principle 

Mi=Eixy  (1) 

where  Ei  is  the  electric  field  at  the  aperture.  The  ground  plane  can  be  removed  by  application  of 
image  theory  and  hence  the  field  in  region  I  can  be  expressed  as  the  radiation  caused  by  Mj  and 
the  external  sources  (J%M’).  The  coupling  of  the  fields  in  each  region  is  achieved  by  requiring 
continuity  of  the  tangential  magnetic  field  across  the  aperture 

HLIv=o  =  H,"  l,=o  (2) 

The  magnetic  field,  inside  the  groove  is  formulated  via  the  finite  element  method  which  has  the 
inherent  geometrical  and  material  adaptability  and  low  0(N)  storage  requirement.  This  procedure 
and  the  coupling  in  (2)  is  described  in  detail  in  [1].  is  expressed  as  an  integral  of  Mi  using 
the  free  space  Green’s  function.  To  this  integral  equation  we  apply  a  version  of  the  fast  multipole 
method  [5]  to  reduce  the  operation  count  from  O(N^)  to  <9(A’'^^’^)  where  Ni  is  the  number  of 
unknowns  on  the  boundary;  the  next  section  describes  this  procedure  for  TE  incidence. 


2.1  Boundary  integral  for  TE  case 

The  groove  is  illuminated  by  the  plane  wave 

jj!  _  _jgjfco(2:sin<^io4-J/ cos.io)  ^3^ 

where  ko  =  27r/Ao  is  the  free  space  wavenumber  and  (po  is  the  angle  of  incidence.  With  a  z-directed 
impressed  magnetic  field  the  scattered  magnetic  field  will  also  be  z-directed,  and  consequently 
the  equivalent  magnetic  current  Mi  may  be  written  as 

Ml  =  mi(x)  (4) 

From  figure  2(b),  the  magnetic  field  in  region  I  due  to  Mi  is  given  by 

H’(r)  =  lir(x,y)  +  H"/‘(x,y) 

-  jkoYo  f  M,(x')Go(p,p')dz’  (5) 

JCi 


GoiP,p')  =  -y  ^r(V(^-^')^  +  (t/-t/02 
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is  the  free  space  Green’s  function.  Applying  Galerkin’s  technique  to  (5)  yields  the  matrix  equation 

[cm  =  {4>^n-[pm  (7) 

where  {<p}  is  the  column  vector  with  Nb  +  1  elements  representing  the  nodal  magnetic  field  on  Ci, 
and  Nb  is  the  number  of  segments  employed  for  the  discretization  of  Cj.  {^}  is  the  column  vector 
with  Nb  elements  representing  the  magnetic  current  on  the  boundary.  The  column  vector 
has  Nb  elements  given  by 

+  m=  1,2,...,  Nb  (8) 

with  Axm  being  the  segment  length  of  the  mth  segment  on  Ci  and  Xm  being  its  midpoint.  The 
product  [€]{<!>}  in  (7)  is  of  not  much  concern  since  [C]  is  a  sparse  matrix  assembled  from  the 
interaction  of  the  magnetic  field  and  current  on  the  boundary  segments.  The  product  [P]{V’}  is  of 
concern  because  of  the  storage  required  by  the  full  matrix  [P]  whose  elements  are  given  by 


Pmr,  =  {H^^\ko\x,n  -  Xn\)  AXrnAXn}  m^Tl  (9) 

To  employ  the  approximate  version  of  the  FMM  in  solving  (7),  the  unknowns  on  the  boundary  are 
divided  into  groups  with  Mb  unknowns  in  each  group  and  thus  the  number  of  groups  will  then  be 
^  .  For  large  source  to  observation  distances,  the  kernel  in  (9)  is  approximated  by  using  the  large 
argument  expansion  as  [5] 

(10) 

where  is  the  distance  between  the  center  of  the  test  group  I  and  the  center  of  the  source 
group  l'\  Xnv  is  the  distance  between  the  nth  source  element  and  its  group  center  and  xim  is  the 
distance’ between  the  mth  test  element  and  its  group  center.  The  decoupling  of  test-source  element 
interactions  in  the  kernel  as  in  (10)  enables  the  computation  of  the  matrix- vector  product  for 
far-field  groups  with  a  reduced  operation  count  as  detailed  in  the  following  sequence. 

1.  For  each  test  group,  the  aggregation  of  source  elements  in  a  single  source  group  involves  Mb 
operations,  corresponding  to  the  number  of  elements  in  the  source  group.  The  aggregation 
operation  corresponds  to 

hi-i  =  (11) 

i=i 


2.  Since  the  above  aggregation  operation  needs  to  be  done  for  all  source  groups  the  operation 
count  becomes  0{^^Mb)  ~  0{Nb),  where  ^  represents  the  total  number  of  groups.  Also 
this  operation,  being^ dependent  only  on  the  test  group  rather  than  the  t^t  element,  needs  to 
be  repeated  for  ^  test  groups  leading  to  a  total  operation  count  of  O(^)  for  aggregation. 

3.  The  next  step  would  be  a  translation  operation  corresponding  to 


C/'i  = 


2j 

TT  y^koXl'l 


bin 


(12) 


Since  this  needs  to  be  done  only  at  the  group  level,  it  involves  0{j^)  operations  for  all 
possible  test  and  source  group  combinations  and  is  the  least  computationally  intensive  step. 
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4. 


The  final  step  in  the  sequence  would  be  the  process  of  disaggregation  corresponding  to  the 
operation 


Nb/Mb 


(13) 


i'=l 


Conceptually,  this  process  is  the  converse  of  aggregation.  Since  this  operation  involves  only 
the  source  group  instead  of  the  source  element  it  needs  to  be  done  for  each  source  group 
thus  implying  an  operation  to  generate  a  single  row  of  the  matrix-vector  product.  To 

generate  Mb  vows  corresponding  to  a  test  group  the  operation  count  would  be  0{Nb).  With 
^  test  groups,  the  operation  count  would  be  of  O(^). 


5.  The  near  field  operation  count  being  of  O(NbMb)  and  the  far  field  being  O(j^)  gives  a  total 
operation  count  of 

Ar2 

Op.count CiNbMb  +  C2-rr  (14) 

Mb  ^ 

Typically,  we  can  set  the  elements  in  each  group,  Mb  =  \/Nb  and  as  a  result  the  total  operation 
count  is  0~  Nb'^. 


Once  the  currents  are  computed  in  this  manner,  the  far-fields  are  computed  in  th  usual  manner. 
Thus  the  maximum  computational  requirement  in  this  hybrid  algorithm  is  unlike  the  usual 

FE-BI  which  results  in  0{N^)  if  the  boundary  has  N  unknowns. 


3  Results 

A  computer  code  based  on  the  above  formulation  w'as  implemented  and  executed  on  a  HP  9000/750 
workstation  with  a  peak  flop  rate  of  23.7  MFLOPS.  The  geometry  considered  was  the  rectangular 
groove  showm  in  figure  1.  Figure  3  shows  the  bistatic  RCS  at  normal  incidence  for  a  groove  20A  wide 
and  0.25A  deep  with  a  material  filling  of  =  4  and  pr  =  1-  The  discretization  of  this  geometry 
results  in  2106  unknowns  with  a  boundary  integral  of  300  unknowns.  To  obtain  a  measure  of  the 
speed-up  realized  by  the  application  of  the  FEM-FMM  to  this  problem,  we  refer  to  the  CPU  time 
comparisons  also  shown  in  figure  3.  The  compiled  data  on  the  FE-BI  and  FEM-FMM  methods 
include  the  CPU  time,  storage  requirement  and  average  error.  Data  are  also  give  for  the  FE-BI 
implementation  using  the  special  CG-FFT  solver  but  it  should  be  noted  that  the  application  of  the 
CG-FFT  is  only  suited  for  planar  apertures.  Clearly,  the  compiled  run  data  show  that  the  FEM- 
FMM  is  about  three  times  faster  than  the  traditional  FE-BI  implementation  and  requires  only  one 
sixth  of  the  memory.  However,  the  FEM-FMM  has  some  deterioration  in  accuracy  (average  error  of 
2  dB  for  this  example  [6])  and  this  should  be  considered  depending  on  the  application  and  usage  of 
the  results.  Similar  data  shown  in  figure  4  for  plane  wave  incidence  at  an  off- normal  incidence  angle 
(45*^)  seem  to  verify  the  previous  conclusions.  The  groove  analyzed  in  figure  4  is  30A  wide  and  is 
0.15A  deep  with  a  material  filling  of  e,.  =  6  and  =  ^-  The  discretization  of  this  geometry  results 
in  1803  unknowns  with  a  boundary  integral  of  450  unknowns.  The  error  in  the  FEM-FMM  solution 
is  low'er  but  this  could  be  attributed  to  the  use  of  a  larger  near-group  window  (4.3A  instead  of  2.45A) 
which  results  in  an  increase  in  storage  and  execution  time  of  the  boundary  integral.  Clearly  the 
CG-FFT  solver  is  substantially  faster  and  arguably  uncontested  by  the  FEM-FMM  method.  This 
simply  gives  credence  to  modifications  of  the  boundary  integral  surface  so  that  the  CG-FFT  can 
be  applied  whenever  possible  even  though  the  boundary  surface  may  not  be  completely  planar. 
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Figure  1:  The  FE-BI  system  matrix  arising  from  the  scattering/radiation  problem  of  a  groove  ina 
ground  plane.  Unknowns  88-105  are  the  boundary  integral  unknowns. 
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M  ^  Region  1 


^  ^  Region  2  ^  \ 

(c) 


Figure  2:  (a)  Geometry  of  the  groove  recessed  in  a  ground  plane  (b)  Equivalent  problem  for  region 
1,  and  (c)  region  2 
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3:  Bistatic  scattering  from  a  rectangular  groove  20A  wide  and  0.35A  deep  at  normal  incidence 


Time  (for  BI  only) 

FEM-BI  (CG) 

FEM-FMM 

FEM-BI  (CGFFT) 

(Minutes, seconds) 

30  mins  36  secs 

9  mins  1  sec 

2  min  15  secs 

Memory  (for  BI  only) 

LU 

FEM-FMM 

(KB) 

1544 

436 

Average  error 

FEM-FMM 

FEM-BI  (CGFFT) 

(dB) 

1.553 

1.0267 

Figure  4:  Bistatic  scattering  from  a  rectangular  groove  30A  wide  and  O.loA  deep  at  45°  incidence 


570 


Finite  Element  Method  Analysis  of  the  Celestron-8  Telescope 


R.  R.  DeLyser 
University  of  Denver 
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Kirtland  AFB,  New  Mexico 


The°Cele^on-8  cassegianian  telescope  (Figure  1)  is  representative  of  typical  sensors  that  could  be  vulnerable  to  high 
power  microwave  (HPM)  energy.  It  was  therefore  analyzed  using  the  Hewlett  Packard  High  Frequency  Structure 

Simulator  (HFSS). 


Figure  1.  The  Celestron-8  telescope. 

HFSS  is  a  three  dimensional  finite  element  simulator.  In  the  simulator,  the  geometric  model  is  automatically  divided  mto 
a  large  number  of  tetrahedra,  the  finite  element  mesh.  Within  each  tetrahedron,  the  vector  field  ^mponents 
perpendicular  to  the  edges  and  tangential  to  the  edges  are  mathematically  modeled  using  polynomials.  The  finer  the 
mesh  the  more  accurate  the  solution.  Of  course,  the  finer  the  mesh,  the  greater  the  demands  on  compumUonal  resoui^ 
in  terms  of  memory,  disk  space  and  solution  time.  In  order  to  find  a  good  solution  to  the  problem  and  "umi^  computer 
resources  HFSS  uses  “an  iterative  process  in  which  the  mesh  is  automatically  refined  in  critical  regions.  The  simulator 
first  solves  the  problem  using  a  course  initial  mesh,  refines  the  mesh  in  areas  of  high  error  density,  generates  a  mw 
solution,  etc.,  until  selected  S-parameters  converge  to  within  a  desired  limit,  or  until  all  computer  resources  have  been 
exhausted. 

The  computer  that  we  use  to  run  HFSS  is  an  IBM  RS6000  model  570  PowerServer  with  128  MBytes  of  random  acce^ 
memory  (RAM)  and  two  1  GByte  hard  drives.  We  have  access  to  all  of  the  RAM,  about  200  MBytes  of  i^gmg  spaM,  and 
340  MBytes  of  disk  space  for  user  files.  During  the  solution  process,  a  convergence  menu  can  be  momtored 
memoiy  and  file  size  values.  Two  matrices  are  written  to  disk  (user  files);  the  A  matrix  is  the  full  matnx  and  Ae  U 
matrix  is  the  “upper  triangular  matrix”.  Ail  other  memory  size  values  indicated  on  the  convergence  menu  are  RAM 
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usage.  As  stated  above,  the  iterative  process  of  mesh  refinement  is  continued  until  convergence,  or  until  the  computer 
resources  are  exhausted.  If  the  disk  space,  paging  space  or  RAM  is  exhausted,  the  solution  did  not  converge. 

A  total  of  eight  projects  were  created  for  HFSS.  They  were: 

1.  cyltest.pjt  -  A  cylindrical  waveguide  port  excited  cylinder  which  is  the  same  size  as  the  outer  cylinder  of  the  Celestron 
Telescope  (see  Figure  2.)  The  radiation  boundary  was  positioned  150  mm  from  the  cylinder  boundary.  This  was  done  so 
that  the  radiation  boundary  would  be  1/4  X  (at  the  lowest  frequency  of  interest  -  0.5  GHz)  from  the  radiating  surface  as 
suggested  by  HP. 


Boundary 

Outer 

Cylinder 


Port 


Figure  2.  The  cylindrical  waveguide  port  excited  cylinder. 

2.  cylinder.pjt  -  A  coaxial  probe  excited  cylinder  which  is  the  same  as  shown  in  Figure  24  with  the  exception  that  the 
port  is  replaced  by  a  coaxial  probe  which  has  its  center  conductor  inserted  along  the  z  axis  into  the  cylinder  a  distance  of 
3  cm.  The  probe  is  1  mm  in  radius.  The  outer  conductor  of  the  coaxial  port  is  2.3  mm  in  radius. 

3.  telescop.pjt  -  A  lull  model  of  the  telescope  with  the  exception  that  the  large  mirror  is  flat.  We  could  not  get  the  2-D 
cross-section  to  close  using  the  true  radius  of  curvature.  Since  the  2D  cross-section  would  not  close,  the  3D  mirror  could 
not  be  created.  Thus,  a  flat  mirror  with  approximate  tilt  was  created.  The  center  of  the  mirror  is  at  z  =  72  mm  and  the 
outside  is  at  z  =  73.8  mm.  This  model  is  excited  by  a  cylindrical  waveguide  port  located  at  the  eye  piece. 

4.  teletesl.pjt  -  A  full  model  of  the  telescope  (Figure  3)  as  detailed  above  in  telescop.pjt  with  the  exception  that  the  eye 
piece  is  open  and  a  coaxial  probe  and  port  as  described  in  cylinder.pjt  is  located  at  z  =  167.5  mm. 

5.  telehf.pjt  -  A  model  of  the  telescope  excluding  the  volume  behind  the  large  mirror  and  with  the  radiation  boundary 
located  17  mm  from  the  large  cylinder  boundary.  This  model  was  specifically  created  at  a  frequency  of  4.5  GHz  in  an 
attempt  to  have  a  model  at  a  frequency  higher  than  the  cutoff  frequency  of  the  inner  cylinder  (4.4  GHz). 

6.  cyltest2.pjt  -  An  open  cylinder  excited  by  a  square  coaxial  probe  at  the  center  of  the  cylindrical  shell  with  the  probe 
oriented  along  the  x  axis.  The  square  cross-section  probe  was  used  to  alleviate  the  problem  of  a  high  mesh  density  at  the 
small  cylindrical  probe.  The  inner  conductor  (probe)  of  the  coaxial  feed  is  4  mm  wide,  the  outer  conductor  has  a  9.2  mm 
inside  width.  The  radiation  boundary  was  chosen  to  accommodate  a  0.75  GHz  - 1.5  GHz  frequenq^  range. 

7.  cyltest3.pjt  -  This  project  is  the  same  as  cyltest2.pjt  with  the  exception  that  the  radiation  boundary  is  closer  to 
accommodate  a  1.5  GHz  -  2  GHz  frequency  range. 

8.  telerad. pjt  -  This  model  was  like  that  for  telescop.pjt  except  that  the  radiation  boundary  condition  was  enforced  at  the 
aperture  of  the  telescope.  This  in  effect  eliminates  any  fields  due  to  currents  on  the  outside  of  the  telescope.  This  has 
some  effect  on  the  fields  inside  the  telescope  but  should  provide  minimal  error.  The  goal  was  to  get  a  model  that  would 
work  at  a  frequency  above  the  cutoff  frequency  of  the  small  inner  cylinder.  HFSS  failed  to  refine  the  mesh  on  the  second 
pass.  We  think  that  this  is  an  indication  that  we  ran  out  of  RAM  (>128  MBytes.) 
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Figures.  The  coaxial  probe  excited  telescope. 

Table  1  sho»..hede.aiU  of  U>evariom  project  and 

measured.  The  model  did  not  converge.  Also,  the  telehf.pjt  model  did  not  converge. 


Table  1.  Summary  of  HFSS  projects  and  resource  requirements. 


Name  of  Project 

Frequency 

A  matrix 

Disk  Snace  (K) 

U  matrix 

Disk  Space  (K) 

csolve 

KAMCK) 

Total 

Time 

cyltestpjt 

1  GHz 

2  GHz 

16,302 

25.717 

65,965 

134.732 

37,340 

67.996 

65  820 

3i:iv 

1:25:54 

1:22:42 

17:13 

41:48 

2:40:00 

3:03:00 

Figures  4  -  7  show  results  for  me"n  wm^^to  *e^rSiation  plot 

project,  we  looked  at  the  radiation  results  )  frenuencv  of  the  inner  cylinder,  the  radiation  increases 

^bSally.  Vis'toerSLgTnote^^^  "^SSlabLSs  aftl^VnTo^^^^ 

Se“Ts^^TwTdrfie.d  intenaily  magnimde  plots  of  Figures  5  and  6  also  sho«  signiflcan. 


fields  behind  the  large  mirror. 


573 


Figure  8.  Radiation  at  4.5  GHz. 

HFSS  Results  vs.  Physical  Testing 

The  most  successful  models  were  cyltest2.pjt  and  cyltest3.pjt.  Results  for  the  fields  and  radiation  plots  at  the  two  major 
resonances  are  shown  in  Figures  9  - 14.  As  a  reference,  the  TE^n  and  TEju  modes  for  a  closed  cylindrical  ca^p^  of 
dimensions  of  our  open  cavity  resonate  at  1.36  GHz  and  1.822  GHz.  The  electric  field  plots  at  1.35  GHz  and  1.8  GHz 
clearly  indicate  that  the  transverse  modes  are  TEji  and  TE31  respectively. 


Figure  9  Electric  field  - 1.35  GHz. 


Figure  10.  Magnetic  field  - 1.35  GHz,  <{>-45". 
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Figure  11.  Radiation  at  1.35  GHz. 


Figure  12.  Electric  field  - 1.80  GHz. 
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Figures  15  -  17  show  results  for  S„  from  HFSS  and  two  tests  of  the  telescope  with  a  coam^  prote  ^ 

cySL2.pjt  and  cyltestS.pjt.  These  measurements  were  done  once  having  a  conducUng  plate  at  aP^  /f  ^ 

telescopfandthe^same  measurements  were  done  having  no  cover  plate  at  the  aperture.  Figures  16  and 

resona^s  are  at  the  same  location  but  of  different  depths.  It  appears  that  our  simple  model  of  an  open  cylinder 
accurately  models  the  telescope  for  these  low  frequency  resonances. 

Finally,  the  quality  feaors  for  the  two  resonances  at  1.8  OHz  and  1.35  GHz  are  calculated  by  the  HFSS  post  processor. 
These  values  are  55.5  at  1.8  GHz  and  25.20  at  1.35  GHz. 

The  concept  of  determining  energy  at  a  particular  point  or  area  in  a  system  by  doing  the  inverse  (ante^a)  problem  is  an 

attractive  one  The  vast  amount  of  information  contained  in  the  HFSS  solution  can  be  used  to  determine  optimum 

incidence  angles  to  couple  the  energy  into  the  system,  field  quanUties  anywhere  iii  the 

factor  for  the  system,  etc.  HFSS  was  used  in  this  study  to  demon^te  the  concept;  perha^oAer  ^s 

TEMAC-3D  (Temporal  Electromagnetic  Analysis  Code,  a  fimte  difference  Ume  dommn  (FD  )  ^ 

Element  Radiation  Model,  a  MOM  code)  and  CARLOS-3D  (Code  for  Analysis  of  Radiators  on  Los^  Spaces)  c^ld  also 

be  used  to  solve  the  antenna  problem.  All  of  these  codes  are  available  to  DOD  authonzed  persomel  and  are  designed 

exception  of  CARLV3D)to  do  the  radiation  prob^ 

University  of  Denver  computer  to  explore  the  radiation  problem.  CARLOS-3D  wodd  also  be  ^ 

radiation%blem  because  of  its  ability  to  handle  dielectrics  as  well  as  conductors.  However,  modifications  to  CARLO 

3D  would  have  to  be  made  to  do  the  radiation  problem. 

Thiswr^^  funded  by  Phillips  Laboratoiy/WSTS  and  the  Air  Force  Office  of  Scientific  Research. 


Figure  15.  HFSS  results  for  the  probe  excited  open  cylinder  (10  MHz  resolution.) 
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Abstract 

Some  results  of  the  analysis  of  errors  for  the  Adaptive  Integral  Method  (AIM)  are  presented. 
The  errors  are  introduced  by  the  use  of  auxiliary  basis  functions  in  computing  far  field 
impedance  matrix  elements.  The  error  dependence  on  the  multipole  expansion  order,  the 
size  of  the  Cartesian  grid  spacing  and  the  geometry  of  the  problem,  i.e.  the  size  and  the 
shape  of  the  basis  functions’  supports  and  their  relative  location,  are  discussed. 


1.  Description  of  the  AIM  algorithm 

The  Adaptive  Integral  Method  (AIM)  [1]  belongs  to  the  class  of  “fast”  iterative  solvers 
of  Maxwell’s  equations  in  their  integral  form.  These  methods  significantly  improve  upon 
the  conventional  Method  of  Moments  (MoM)  order  O(iV^)  impedance-matrix  storage  and 
matrix-vector  multiplication  complexity,  where  N  is  the  number  of  unknowns.  The  im¬ 
provement  is  achieved  by  effectively  “compressing”  the  impedance  matrix  by  means  of 
either  purely  algebraic  or  physically  motivated  algorithms. 

In  the  AIM  algorithm,  the  impedance  matrix  is  decomposed  into  “near-”  and  “far- 
field”  components.  The  near-field  component  is,  by  construction,  a  sparse  matrix,  whose 
elements  are  computed  as  in  the  conventional  MoM,  using  the  Galerkin  method  with  ar¬ 
bitrary  localized  basis  functions,  representing  volume,  surface,  or  line  current  elements. 
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In  the  far-field  component  the  original  current  elements  are  approximated  by  appropri¬ 
ately  specified  equivalent  distributions  of  pointlike  sources  located  at  nodes  of  a  regular 
three-dimensional  Cartesian  grid.  Since  the  transformation  between  the  original  and  the 
equivalent  current  distributions  is  given  by  a  sparse  matrix,  and  the  Green’s  function  dis¬ 
cretized  on  the  grid  is  a  (three  level)  Toeplitz  matrix,  the  storage  is,  asymptotically,  0{Nc), 
where  is  the  required  total  number  of  Cartesian  grid  nodes,  proportional  to  in 

surface  problems,  and  to  N  in  volumetric  problems.  Similarly,  the  serial  matrix-vector 
multiplication  complexity  is  log  N)  and  0{N  log  N)  for  surface  and  volumetric 

problems,  respectively. 

One  of  teh  methods  of  determining  the  equivalent  Cartesian  grid  sources  is  to  re¬ 
quire  that  they  reproduce  the  multipole  moments  of  the  original  current  element,  up  to 
a  desired  multipole  order  M.  The  AIM  solver  based  on  this  approximation  criterion  has 
been  fully  implemented  in  both  serial  and  massively  parallel  versions,  and  successfully 
applied  to  large-scale  three-dimensional  scattering  problems  involving  perfectly  and  im¬ 
perfectly  conducting  bodies.  Here  we  present  some  implications  of  the  error  analysis  for 
the  multipole-based  approximation,  including  issues  such  as  the  choices  of  the  Cartesian 
grid  and  the  expansion  order. 

The  far-field  approximation  of  the  AIM  algorith  is  applicable  to  any  Galerkin  matrix 
elements  of  the  form 


=  j  dxi  da:2^«(xi)ff(xi -X2)V'^(X2)  =  (t^a»  5^^)  > 

(1.1) 

where 

1  jb  Ixl 

K  |X| 

^W-4dxl 

(1.2) 

is  the  homogeneous-space  Helmholtz  equation  Green’s  function  with  fc  =  27r/A  being  the 
wave  number  in  the  medium.  The  form  (1.1)  includes  all  the  matrix  elements  arising 
in  the  discretization  of  Maxwell’s  equations,  in  which  case  the  functions  and  axe 
components  of  vector-valued  basis  functions  representing  electric  and  magnetic  current 
elements,  as  well  as  derivatives  (divergences  and  curls)  of  these  basis  functions. 

In  the  far-field  matrix  elements  the  basis  functions  are  approximated  by  Unear  com¬ 
binations  of  Dirac  delta-functions, 

V’„(x)  ~  W  =  Ku  -  u)  ,  (1-3) 
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localized  at  some  set  of  grid  nodes  u,  depending  on  the  basis  function  In  the 
present  implementation  is  a  cubic  expansion  box  of  (M  +  1)^  nodes,  whose  center  is 
nearest  to  the  center  of  the  support  of 

According  to  the  multipole-expansion  criterion,  the  coefficients  A  are  determined  by 
the  set  of  (M  -f  1)^  equations  expressing  equality  of  the  multipole  moments 

Qa,  (m,  m3)  ==  J  ^a(x)  (1.4) 

and  the  corresponding  moments  function  for  all  orders  0  < 

mj,m2,m3  <  M.  This  set  of  condition  leads  to  a  Vandermonde- type  linear  system  of 
equations  for  A. 

With  Eq.(1.3),  we  define  the  approximate  far-field  matrix  elements  as 
5^/?)  =  y  dx2^o(Xi)^(Xi  -X2)^^(X2) 

=  12  ^au<?(u-v)A^^  . 

The  AIM  approximation  to  the  impedance  matrix  A  is  then  constructed  as 

^AIM  _  ^Near  _j_  ^Far 


(1.5) 

(1.6) 


where 

=  I  ~  ^^st(C'^,  C^)  <  and  fi  7) 

\  0  otherwise,  ^ 

and  dist(C^,  C^)  is  the  distance  between  the  expansion  boxes  defined  in  terms  of  the  ||  • 

norm  in  grid-spacing  units. 

The  accuracy  of  the  approximation  A  ~  is  controlled,  in  addition  to  the  ex¬ 
pansion  order  M,  by  the  near- field  range  and  the  near- field  tolerance  £^^31.:  if  the 

distance  between  the  expansion  boxes  exceeds  d^eari  near-field  component  of  the  ma¬ 
trix  is  set  to  zero;  for  smaller  distances,  A^^^"  is  defined  as  the  difference  of  the  MoM  and 
far-field  matrix  elements,  or  as  zero,  if  that  difference  is  small  (in  terms  of  the  required 
tolerance  £NeaT)- 

Eqs.  (1.6)  and  (1.7)  exhibit  the  structure  of  A'^^^  as  a  compressed  approximate  A: 
A^^^  is  a  sum  of  the  sparse  matrix  and  a  product  of  sparse  and  Toeplitz  matrices. 


A^^"  =  AgA^ 


(1.8) 


(cf.  Eq.(1.5)). 
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2.  Results  of  the  Error  Analysis  for  the  A  ~  Approximation 

We  present  below  the  main  results  of  the  analysis  of  errors  associated  with  the  approxima¬ 
tion  A  ~  Since  this  approximation  is,  typically,  used  in  the  framework  of  iterative 

solution  methods,  we  base  our  error  analysis  on  the  estimates  of  residuals  in  the  equation 
Ax  =  b.  If  y  is  an  approximate  solution  to  the  equation  x  =  b,  then  the  residual  of 
the  original  equation  satisfies  the  bound 

\\b  -  Ayh  <  P  -  Vh  +  P  -  llvP 

<  ||6  -  j/P  +  P  -  PP  .  (2-1) 


with  the  Frobenius  norm  defined  by 


|1A-A^^^11|=^K,-A^™p.  (2.2) 


Therefore,  in  order  to  control  the  accuracy  of  the  solution,  we  have  to  estimate  the  norm 
(spectral,  or  at  least  Frobenius)  of  the  difference  A  -  A^™.  As  the  measure  of  the  relative 
error  we  take  thus  the  ratio 

Aaim  = 


1|A-AAIM||2 

\m 


(2.3) 


According  to  Eq.(1.7),  equivalent  to 


A  - 

(  0  —  ^Near  ~  -  ^Near  (2.4a) 

=  5Aq^  =  <  A^^  —  A^^^  if  <  ^Near  l^a/9  I  ^  %ear  (2.4b) 

if4^>4ear 

(with  =  dist(Ca,  C^)),  the  norm  ljA-A^™|lp  receives  near-  andfar-field  contributions, 
resulting  from  Eqs.  (2.4b)  and  (2.4c),  respectively.  An  upper  bound  on  the  near-field 
contribution  is 

\\A-  IIf  (Ne„)  <  PIIf  (N,„)  ^  \\Ah  .  (2-6) 

where  the  norm  ||  -  |lF{,jeai)  defined  as  in  Eq.(2.2),  but  with  the  sum  restricted  to 

^q/3  <  4ear- 

^AIM(Near)  —  %ear  *  (2-®) 
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In  practice,  however,  we  find  that  the  near-field  error  is  typically  much  smaller,  because 
only  a  relatively  small  fraction  of  matrix  elements  satisfies  the  conditions  of  Eq.(2.4b). 
We  concentrate,  therefore,  on  the  far-field  contribution  to  the  error;  with  a  fixed  near-field 
range  this  contribution  always  becomes  dominant  for  sufficiently  large  problem  sizes. 

At  large  distances,  the  accuracy  of  the  approximation  (1.5)  is  controlled  by  the  oscilla¬ 
tory  behavior  of  the  Green’s  function  (1.2).  Eq.(1.5)  can  be  interpreted  as  an  (M-(-l)^-point 
quadrature  of  the  integral  of  Eq.(l.l),  with  quadrature  points  u,  v.  By  construction  of 
the  coefficients  A,  the  weights  are  such  that  the  quadrature  is  exact  if  is  a  polynomial 
of  order  up  to  M  in  any  of  the  components  of  and  Xj.  This  interpretation  suggests 
that  the  expansion  order  M  should  be  sufiiciently  high  for  the  oscillatory  Green’s  function, 
~  exp(ik-x),  to  be  well  approximated  by  an  M-th  order  polynomial  within  the  support 
of  the  basis  function  and  within  the  expansion  box.  Thus,  if  the  support  size  is  6,  and 
the  grid  spacing  is  a.  the  relative  error  is  expected  to  be  of  the  order  of  the  Taylor  series 
remainder, 


(max{6,  Ma}  A:)^+^ 


(2.7a) 


(M-fl)! 

A  more  precise  estimate  of  the  far-field  error  is  obtained  from  the  large  distance  expansion 
of  the  Green’s  function;  if  R  =  —  €3  is  the  distance  between  the  centers  and  €3  of 

the  basis  functions’  supports,  we  have 


=  9{R) 


l  +  0(^ 


(2.8) 


where  6  =  (6j  -f-  b2)l2,  and  the  Fourier  transforms  ^  of  the  basis  functions  are  defined 
relative  to  the  centers  Cj  and  C3.  The  asymptotic  error  is  thus  expressible  in  terms  of  the 
error  in  the  Fourier  transforms, 


6^ik)  =  ^(k)  -  ^(k)  =  I  dx  ^{c  -K  x)  -  ^  A„  e*  . 

u€C 


(2.9) 


|x|<(./2 


In  particular,  the  relative  error  becomes  independent  of  the  distance, 


(2.10) 
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k  =  fcR),  and  provides  a  simple  estimate  for  the  far-field  contribution  to  the  error  (2.3), 

AAIM(Far)  ^  2  mM  Ai?„(k)  .  (2.11) 

a,k 

By  construction,  the  basis  functions  ?/>(x)  and  ^(x)  have  identical  power-series  expan¬ 
sions  in  components  of  k  up  to  the  order  M.  Therefore,  the  error  (2.9)  is  of  order  k  , 
with  the  coefficient  proportional  to  the  difference  between  the  multipole  moments,  Q  -  Q, 
of  order  (M  -h  1).  Although  the  moments  Q  depend  in  a  complicated  way  on  the  original 
moments  Q  of  orders  up  to  M,  and  on  the  grid  parameters,  we  have  obtained  a  rigorous 
bound 

-  (M  +  l)l 

for  M  >  1,  0  <  a  <  2fe/max{5,M}  .  (2.12) 

The  restriction  on  the  grid  spacing  a  is  crucial  here;  As  long  as  a  is  small  compared  to  the 
size  6,  the  moments  Q  of  order  higher  than  M  remain  comparable  to  the  corresponding 
moments  Q  (in  particular,  they  vanish  when  a  0).  For  larger  grid  spacings,  however, 
the  moments  Q  typically  grow  uncontrollably  large,  especially  for  higher  expansion  orders 
M.  For  large  a  and  M  the  error  may  become  as  large  as 

|6^(k)l^yX(f)“^‘/dxWx)|.  (2.13) 

Physically,  the  constraint  aM  <  2b  implies  that  the  size  of  the  expansion  box  should  not 
significantly  exceed  the  size  of  the  basis  function  support.  On  the  other  hand,  memory 
requirements  and  complexity  of  the  algorithm  grow  rapidly  with  the  decreasing  grid  spac¬ 
ing  (proportionally  to  (A/a)^);  therefore,  the  optimal  choice  is  the  largest  grid  spacing 
providing  an  acceptably  small  error. 

We  carried  out  an  extensive  numerical  analysis  of  the  errors  in  matrix  elements  for 
the  Maxwell’s  equation  case,  using  the  conventional  vectorial  Rao-Wilton-Glisson  basis 
functions.  As  an  example,  we  show  in  Fig.  1  the  magnetic-field  operator  error  (which  is 
larger  than  the  electric-field  one)  (Eq.(2.10))  as  a  function  of  the  distance  R  = 
between  the  current  elements.  We  used  typical  values  of  parameters:  a  —  A/8,  b  =  2<2,  and 
M  =  2,3,4. 
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The  curves  in  Fig.  1  exhibit  characteristic  periodic  oscillations  due  to  the  error  depen¬ 
dence  on  the  location  of  the  basis  functions  relative  to  the  grid  nodes.  They  clearly  show 
the  convergence  of  the  expansion,  and  the  fact  that  the  asymptotic,  J?-independent  accu¬ 
racy  level  sets  in  already  at  relatively  small  distances,  well  within  the  wavelength  A  =  8a. 
Since  the  AIM  multipole  expansion  can  be  interpreted  as  a  specific  quadrature  applied 
to  matrix- element  computation,  we  also  compared  the  matrix  element  errors  in  the  AIM 
algorithm  with  the  typical  quadrature  errors  of  the  “exact”  MoM.  Fig.  1  shows  (marked  as 
“MoM”)  the  difference  in  the  MoM  matrix  elements  calculated  using  12-point  and  3-point 
two-dimensional  Gaussian  quadratures  on  triangles;  the  latter  is  the  customary  choice  for 
the  non-self  terms,  especially  for  large  problems,  where  the  matrix  fill  is  one  of  the  most 
computationally  intensive  parts  of  the  algorithm.  The  comparison  indicates  that  the  typ¬ 
ical  Gaussian  quadrature  errors  in  the  MoM  matrix  elements  are  of  order  0.1%;  it  would 
be  thus  pointless  to  attempt  to  reduce  the  multipole  expansion  errors  below  the  ~  10“^ 
level. 

Since,  ultimately,  we  expect  the  accuracy  of  the  solution  to  be  controlled  by  the 
relative  Frobenius  norm  (2.3),  we  also  analyzed  that  quantity  numerically  for  a  number  of 
representative  scattering  problems.  As  expected,  A^jm  for  larger  problems  is  dominated 
by  the  far-field  contributions,  and  becomes  independent  of  the  problem  size.  We  give 
here  some  results  for  the  perfectly  conducting  NASA  almond  of  lenghts  2.5  A  and  5  A, 
corresponding  to  ~  2,  500  and  N  ~  10, 000  unknowns.  The  surface  triangulation  in  both 
cases  corresponds  to  ~  A/8.  We  found,  in  particular,  that  the  accuracy  <  10"^ 

results  in  the  back- scattering  cross-section  deviating  from  its  MoM  value  by  less  than 
about  0.1  dB  for  all  polarizations  and  in  the  full  angular  range.  Such  an  accuracy  can  be 
attained  with  ~  10~^,  <  2,  and,  e.g.,  with  M  =  2  and  a  ~  6/2  or  M  =  3 

and  a  ~  6;  computationally,  the  latter  choice  (a  higher  expansion  order  and  a  larger  grid 
spacing)  is  much  more  economical  than  the  first  one.  We  also  found  that  for  larger  grid 
spacings  (a  >  26)  the  accuracy  is  relatively  poor,  and  does  not  improve  markedly  with  the 
increasing  expansion  order. 
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Fig.l.  Relative  errors  in  the  matrix  elements  for  the  MaxweU  equation, 
for  expansion  orders  M  =  2, 3, 4,  together  with  the  quadrature  error  in  the  MoM 
matrix  elements  (marked  “MoM’).  Details  are  desribed  in  the  text. 

References 

1.  E.  Bleszynski,  M.  Bleszynski,  and  T.  Jaroszewicz,  “A  Fast  Integral  Equation  Solver 
for  Electromagnetic  Scattering  Problems”,  in  Proceedings  of  the  1994  IEEE  APS 
International  Symposium,  Seattle,  Washington,  pp.  417-420. 


58 


USING  MODEL-BASED  PARAMETER  ESTIMATION  TO 
ESTIMATE  THE  ACCURACY  OF  NUMERICAL  MODELS 


E.  K.  Miller 

3225  Calle  Celestial,  Santa  Fe,  NM  87501-9613 
505-820-7371,  eniiller@esa.lanl.gov 


ABSTRACT 

Determining  the  accuracy  of  the  results  produced  by  a  numerical  model  remains  an  ongoing  and  impor¬ 
tant  problem  in  computational  electromagnetics,  one  to  which  there  are  relatively  few  options.  Perhaps 
the  most  desirable  and  convincing  approach  is  to  perform  a  quantitative  comparison  betw'een  experimen- 
^  data  and  the  model  output.  Such  a  comparison,  though  undeniably  useful,  has  the  disadvantage  that 
it  embodies  two  different  kinds  of  errors,  these  being  the  physical  modeling  error  (PME)  and  the  numer¬ 
ical  modeling  error  (NME).  The  former  ^ses  from  any  differences  that  exist  betw^een  the  actual  physi¬ 
cal  problem  and  its  numerical  representation,  and  the  latter  occurs  because  the  numerical  results  actually 
obtained  represent  only  an  approximate  solution  to  that  numerical  representation.  Although  the  PME  can 
normally  be  expected  to  be  the  larger  of  the  two  and  also  the  more  difficult  to  evaluate,  quantitative 
knowledge  of  the  NME  itself  is  essential,  estimation  of  which  is  the  topic  of  this  paper. 

There  are  two  approaches  now  used  to  estimate  the  NME,  one  being  the  internal  checks  that  can  be  done 
within  a  speafic  model  or  code  (e.g.,  convergence  tests,  boundarv'  conditions,  power  conservation 
etc.),  and  the  other  the  external  checks  that  involve  comparing  the  results  of  two  or  more  different  mod¬ 
els.  The  approach  discussed  here  is  another  kind  of  internal  check,  but  different  from  the  others  in  the 
sense  that  it  seeks  a  “noise”  level  in  the  model  output  as  a  measure  of  the  model  accuracy. 

The  basic  idea  is  to  apply  a  low-order,  physically  motivated,  fitting  model  (FM)  to  a  sampled,  generat¬ 
ing  model  (GM,  bas^  on  a  first-principles  formulation)  to  be  tested.  Were  these  GM  samples  known 
exactly,  then  some  minimum  number  would  in  principle  permit  the  continuous  transfer  function  (TF)  to 
exactly  reconstructed  across  the  frequency  range  that  they  span  if  the  FM  exactly  represents  the  GM 
behavior.  In  fact,  however,  the  GM  samples  and  the  FM  are  only  approximations,  and  were  no  more 
samples  available  the  FM-based  TF  could  possess  no  better  accuracy  either.  However,  by  using  more 
samples  from  the  GM,  the  additional  information  they  provide  can  be  used  to  estimate  the  accuracy  of 
the  TF  derived  from  the  FM  and  of  the  GM  samples  themselves.  Furthermore,  these  extra  GM  samples 
might  also  be  used  to  improve  the  accuracy  of  the  FM  estimate. 


INTRODUCTION 

However  obtained,  determining  the  quantitative  accuracy  of  the  numerical  results  developed  for  a  partic¬ 
ular  electromagnetic  problem  is  essential  if  such  results  are  to  be  reliable.  This  is  a  problem  of  data  vali¬ 
dation.  Another  related,  but  broader,  problem  is  that  of  establishing  the  accuracy  that  may  be  expected 
to  be  obtained  from  a  given  computer  model  when  it  is  employed  following  accepted  guidelines  for  its 
use.  This  IS  a  problem  of  code  validation.  The  latter  problem  is  intrinsically  open-ended  and  one  in 
which  one  or  just  a  few'  code  developers  might  be  involved  while  the  former  is  more  specific  and  can  in¬ 
volve  relatively  many  more  users  of  that  code.  Attention  in  the  following  is  addressed  to  the  problem  of 
data  validation. 


The  basic  question  that  confronts  anyone  who  has  to  deal  with  numerical  results  is  how  accurate  they  are 
relative  to  the  requirements  of  the  problem  for  which  they  have  been  obtained.  This  issue  can  be  ex¬ 
pressed  in  various  ways,  but  ultimately  reduces  to  a  problem  of  associating  error  bars,  or  uncertainty 
bounds,  with  numerical  data  Traditionally,  the  purpose  of  error  bars  is  to  indicate  that  the  results  are 
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thought  to  have  some  numerical  probability  of  lying  within  the  band  they  span  at  some  level  of  confi¬ 
dence.  The  question  of  accuracy,  in  other  words,  has  a  probabilistic,  or  statistical,  context.  While  that 
degree  of  analytical  formality  and  rigor  could  be  useful  in  computational  electromagnetics  (CEM),  it 
goes  well  beyond  current  practice  and  is  not  necessary  as  a  first  step  towards  satisfying  some  minimally 
acceptable  accuracy  statement  for  computed  results.  In  the  discussion  here,  attention  is  limited  instead  to 
deriving  a  quantitative  estimate  of  numerical  solution,  or  data,  accuracy.  Systematic  extension  of  the 
idea  presented  here,  or  some  alternative,  could  lead  to  a  more  rigorous  methodology  for  error  assess¬ 
ment,  In  the  following  sections  we  discuss  in  turn  the  kinds  of  data  validaticm  that  can  be  used,  how  to 
model  EM  observables  or  data  using  reduced-order  models,  how  such  models  can  provide  an  estimate 
of  the  error  in  the  transfer  function  that  data  represente,  and  then  demonsitrate  application  of  this  proce¬ 
dure  to  a  pole  spectrum  and  the  current  induced  on  an  infinite,  circular  cylinder. 


KINDS  OF  DATA  VALIDATION 

There  are  two  fundamental  ways  by  which  data  validation,  or  estimating  the  accuracy  of  electromagnetic 
observables,  can  be  attempted.  One  is  the  compare  the  results  in  question  with  independent  data,  the  ex¬ 
pectation  being  that  the  true  values  are  probably  as  close  to  these  data  as  they  are  to  e^h  other,  some¬ 
thing  that  can  be  described  as  an  external  validation  or  check.  This  is  a  more  convincing  test  when  the 
independent  data  are  experimental  and  come  from  the  actual  problem  of  interest,  as  the  close  agreement 
of  two  numerical  results  doesn’t  necessarily  prove  that  they  are  correct  and  also  doesn’t  shed  any  light 
on  the  physical  modeling  error  (PME). 

The  other  is  to  perform  some  test  on  the  numerical  results  themselves,  either  to  evaluate  the  conformity 
of  the  results  relative  to  Maxwell’s  equations  or  their  numerical  consistency,  as  an  internal  validation  or 
check.  In  the  context  of  CEM  an  internal  check  that  derives  from  Maxwell’s  equations  is  most  satisfac¬ 
tory,  of  which  some  examples  are  reciprocity,  energy  conservation  and  bound^-field  checks.  An  ex¬ 
amine  of  perhaps  the  most  commonly  used  internal  numerical  check  is  a  convergence  test  whereby  the 
behavior  of  the  numerical  solution  is  examined  as  the  number  of  unknowns  is  progressively  increased. 
Unfortunately,  the  convergence  of  a  numerical  model  is  not  always  assured  to  occur,  nor  is  convergence 
to  a  correct  answer  guaranteed  either.  Furthermore,  a  convergence  test  can  be  very  expensive  and  even 
impractical  for  large  problems. 

A  different  kind  of  internal  check  using  Maxwell’s  equations  is  discussed  here.  It  is  based  on  the  obser¬ 
vation  that  the  various  field  quantities  derived  from  a  defining  microscopic  description,  i.e.,  Maxell’s 
equations,  must  subsequently  exhibit  a  known,  reduced-order,  macroscopic  behavior.  The  specific  be¬ 
havior  of  interest  for  our  purposes  is  the  resonance-like  structure  of  an  electromagnetic  frequency  re¬ 
sponse,  which  is  well-described  by  a  rational  function  [Miller  and  Burke  (1991)].  As  is  shown  in  the 
next  section,  knowledge  that  the  transfer  function  (TF)  has  a  simple  analjlical  description  can  be  exploit¬ 
ed  to  test  the  numerical  accuracy  of  the  macroscopic  field  variables,  and  hence,  of  the  model  that  pro¬ 
duced  them. 

The  approach  taken  here  to  estimate  the  accuracy  of  a  numerical  model  is  thus  based  on  modeling  the  so¬ 
lution  as  a  function  of  frequency.  An  alternative  model-based,  approach  might  also  be  employed,  where 
the  error  in  the  solution  is  the  quantity  being  modeled  rather  than  the  value  of  the  solution  itself  This 
approach  requires  knowledge  of  how  the  error  varies  with  the  number  of  unknowns,  X  or  equivalently 
with  the  spatial  sample  interval,  h,  since  this  permits  combining  solutions  using  different  values  of  X,  to 
obtain  a  solution  estimate  better  than  that  for  the  largest  X  used.  When  the  error  varies  as  Ah"  with  n  an 
integer,  a  behavior  true  of  many  numerical  operations,  a  procedure  known  as  Richardson  extrapolation 
results  [Ralston  (1966)].  Using  the  observation  that  an  EM  numerical  modeling  error  (NME)  for  an 

integral-equation  formulation  varies  as  Ae^'®^^  [Miller  et  al.  (1971),  Miller  (1996)],  a  simil^  ^proach 
can  be  used  to  improve  the  best  result  from  a  convergence  test.  The  approach  described  in  Miller  (1996) 
extends  Richardson  extrapolation  to  this  case. 
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MODELING  GENERATING-MODEL  SAMPLES 

It  has  been  demonstrated  [Miller  and  Burke  (1991)]  that  a  sequence  of  numerical  TF  samples,  Sj  =  S(fj), 
computed  from  a  CEM  model  [the  generating  model  (GM)]  as  a  function  of  frequency  f  at  sample  fre¬ 
quencies  fj  can  be  well-approximated  using  a  rational  function  [a  fitting  model  (F^j  as  a  generalization 
of  the  pole  series  that  alone  describes  the  resonance  behavior  of  the  former.  A  P-term  poles  series  can 
be  expressed  as  the  rational  function  r(P-l,P,0  =  n(P-l,0/d(P,0,  where  n(P-l,f)  =  nQ  +  njf  +  n2f^  +  . 

.  .  +  np_  jf^“^,  and  d(P,f)  =  dg  -i-  djf  +  d2f^  +  dpf^,  i.e.  as  a  ratio  of  tw’o  polynomials  of  de¬ 
grees  P  -  1  and  P,  respectively.  However,  as  P  increases,  this  model  becomes  less  robust  numerically. 
Furthermore,  an  electromagnetic  TF  is  not  completely  described  by  a  pole  series,  having  in  addition  a 
non-pole  contribution.  For  these  reasons  it  was  found  that  a  series  of  overlapping,  windowed,  low- 
order  rational  functions  having  the  form  r(N,D,f),  where  N  =  D  or  D  +  1  is  us^,  is  a  better  choice  to 
model  a  wideband  TF.  This  increase  in  the  order  of  n(N,0  relative  to  d(D,0  evidently  helps  to  model 
the  effects  of  poles  that  lie  outside  the  frequency  window  modeled  by  r(N,D,f)  as  well  as  the  unknown, 
non-pole  contribution. 

Each  windowed,  FM  is  quantified  by  fitting  ittoM  =  N  +  D+  1  computed  GM  samples  over  the  fre¬ 
quency  range  that  it  spans,  from  which  its  polynomial  coefficients  are  then  obtained  as  a  solution  to  an 
M’  th-order  data  matrix,  assuming  a  square  ^ta  matrix  is  used.  The  number  of  GM  samples  required  is 
one  less  than  the  number  of  coefficients  in  r(N,D,f)  because  one  of  the  coefficients  is  set  to  unity,  typi¬ 
cally  dp.  The  minimum  number  of  GM  samples  needed,  were  its  TF  to  be  a  purely  pole  series,  would 
be  two  per  pole  or  resonance;  otherwise  a  non-unique  FM  will  result.  However,  because  of  the  factors 
just  discussed,  at  least  3  samples  per  resonance  are  typically  required.  Thus,  if  a  FM  is  to  span  P  poles, 
M  ~  3P  GM  samples  will  be  needed  at  a  minimum.  If  the  FM  order  is  to  be  kept  below  a  ‘Yeasonable” 
number,  say  20,  then  the  number  of  resonances  spanned  by  a  FM  should  be  no  more  than  6,  with  nu¬ 
merator  and  denominator  potymomial  orders  of  at  most  9  or  10. 

Because  the  condition  number  (CN)  of  the  data  matrix  grows  with  increasing  M  for  a  FM  spanning  a 
fixed  frequency  band,  the  numerical  match  or  data  fit  (D^  achieved  betw'een  that  FM  and  the  GM  sam¬ 
ples  used  for  quantifying  it  will  decrease  commensurately.  On  the  other  hand,  the  model  fit  (MF)  ob¬ 
tained  betw  een  the  FM  and  additional  GM  samples  not  cdginally  used  for  its  quantification  will  increase 
with  increasing  M  until  some  maximum,  M’,  is  reached,  beyond  which  MF  will  also  decrease  because 
of  the  increasing  CN.  The  opportunity  for  estimating  the  accuracy,  or  error,  of  the  GM  samples  lies  in 
the  range  betw  een  some  minimum  value  of  M  and  M’  >  M.  Some  results  obtained  using  a  computer 
precision  (CP),  of  24  digits  for  a  15-pole  spectrum  having  Sj  =  vi/20  +  j*i  (j  =  V-1)  are  included  in  Fig. 

1  to  illustrate  these  points,  where  the  M  =  N  +  D  +  1  GM  samples  are  evenly  spaced  between  frequency 
endpoints  of  5.5  and  8.5. 

The  CN  and  DF  as  a  function  of  N  with  D  a  parameter  are  shown  in  Fig.  la  with  all  results  displayed  in 
digits,  i.e.,  where  a  CN  of  10^  has  the  numerical  value  of  X,  etc.  It  can  be  seen  that  increasing  N 
and/or  D  increases  CN  and  decreases  DF,  the  sum  of  which  is  furthermore  about  equal  to  CP.  Thus, 
w'ere  a  better  fit  to  be  sought  betw'een  the  original  data  and  the  FM,  this  might  be  accomplished  by  in¬ 
creasing  CP.  This  possibility  is  not  pursued  further  here  but  is  an  option  to  be  considered. 

The  corresponding  average  match  between  the  FM  and  additional  GM  samples,  MF,  is  shown  in  digits 
in  Fig.  lb.  It  can  been  seen  that  MF  increases  with  both  N  and  D,  until  the  growing  CN  causes  a  down¬ 
turn  for  the  larger  values  of  these  parameters.  By  varying  its  parameters,  it  is  possible  to  develop  a  FM 
from  the  original  GM  samples  that  produces  an  approximate  6-digit  match  to  additional  samples  witon 
the  frequency  range  spanned  by  the  FM.  The  significance  of  this  result  is  further  considered  below. 
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Figure  1 ,  The  condition  number  and  data  lit  between  the  fitting  model  and  the  pole-series  generating-nrodel  ^ples 
used  to  quantify  It  are  presented  in  part  (a),  while  in  (b)  Is  presented  the  agreement  between  the  fitting  model  and  ad¬ 
ditional  generaling-model  samples,  ail  results  being  shown  In  digits  and  as  a  function  of  the  fitting-model  nuFrorator- 
polynomial  order  with  the  denominator-polynomial  order,  D,  a  parameter.  The  maximum  model  fit  shown  in  (b)  indi¬ 
cates  that  the  smallest  error  detectable  for  this  particular  example  would  be  about  10'®.  These  calculations  were  per¬ 
formed  in  24-digit  computer  precision. 


ESTIMATING  THE  ERROR  IN  A  TRANSFER  FUNCTION 

The  accuracy  of  the  GM  samples  can  be  expressed  relative  to  an  exact  answer  for  the  numerical  repre¬ 
sentation  of  the  physical  problem  being  modeled,  Ej  =  E(fj),  as 

Ai=[l(Si-Ei)/EiG  (1) 

or  in  a  somewhat  more  convenient  measure,  digits  of  agreement  using 


Bi-logio[l(Si-Ei)/Eil].  (2) 

For  example,  a  1%  agreement  would  yield  Bj  =  2.0,  while  for  2  x  10’^  agreement,  Bj  would  be  2.7, 
etc.  The  a  priori  accur^y  of  the  numerical  samples  is  usually  unknown  as  is  Ej,  so  B  j  (and  Aj)  is  also 
not  available.  It  is  the  purpose  of  this  discussion  to  develop  a  procedure  for  estimating  Bj  without 
knowledge  of  Ej. 

First  observe  that  when  Sj  is  expressed  in  floating-point  form,  e.g.  using  an  x-digit  mantissa  where  x  = 
8  is  often-used,  these  samples  normally  contain  more  digits  than  can  be  justified  since  a  relative  accuracy 
of  10'^  is  usually  not  expected.  These  excess  digits  can  be  regarded  as  numerical  “noise”  that  repre¬ 
sents  an  accuracy  floor  for  a  given  sample.  Assuming  no  systematic  error  source  affects  the  irornputed 
samples,  the  numerical  noise  should  be  random  and  the  Sj  samples  should  be  randomly  distributed 
around  the  unknown  Ej  values.  The  possibility  of  estimating  Bj  is  based  on  this  assumed  random  varia¬ 
tion  of  Sj  relative  to  Ej  by  comparing  GM  and  FM  samples. 
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The  idea  is  to  oversample  across  the  frequency  range  of  interest,  Fq]^,  the  GM  whose  output  is  to  be 
tested,  using  a  subset  of  these  data  samples,  Sj)p,  to  develop  a  FM  that  spans  part  or  all  of  that  band¬ 
width,  Fpjyj.  This  FM  is  then  used  to  compute  an  average  DF  of  Bp)F  digits  to  the  Spjp  samples  and  an 
average  MF  of  to  the  remaining  samples  over  Fpj^.  The  value  of  B^^IF  relative  to 

vides  a  consistency  measure  between  the  two  data  sets  and  the  FM.  This  is  so  because  all  of  the  data 
comes  from  the  same  GM  and  so  Sj^p  and  must  be  internally  consistent  to  an  amount  determined 

by  the  average  GM  accuracy',  Bqivi.  The  value  of  Bp)p  for  the  data  set  S£)p  will  be  only  as  accurate  as 
permitted  by  the  CN  of  the  data  matrix  and  CP  used  as  illustrated  in  Fig.  la.  It  is,  however,  relatively 
independent  of  and  can  actually  exceed  it  since  Bp)p  provides  only  a  measure  of  well  the  FM  can 

match  a  set  of  numbers  whose  data  matrix  has  a  condition  number  CN  when  the  computations  are  per¬ 
formed  to  a  specified  computer  precision  CP.  However,  the  value  of  for  the  data  set  Sj,^  will  be 

limited  in  accuracy  to  the  lesser  of  Bqj^  and  Bpp  because  the  FM  can  match  the  data  set  no  better 
than  it  matches  the  data  set  S[)p  on  which  it  is  based,  the  accuracy  of  which  is  Bqj»^.  Two  outcomes 
are  thus  possible:  1)  Bpp  ~  B^;  and  2)  Bpp  >  B^.  The  third  possibility  ,  Bp,p  <  B^,  has  al¬ 
ready  been  excluded.  In  either  case,  B^^  provides  the  valid,  lower-bound  estimate  sought  for  a 

point  to  be  demonstrated  in  the  following. 

It’s  worthwhile  to  observe  that  the  procedure  just  outlined  is  similar  to  the  use  of  linear  regression.  For 
that  particular  problem,  the  goal  is  to  obtain  the  best  fit  between  available  data  and  a  strai^t  line  w'hich 
serves  as  a  model  for  the  data.  Uncertainty  or  errors  in  the  data  can  be  estimated  by  the  mismatch  that  is 
found  between  that  data  and  tiie  best-fit  line.  In  our  case,  the  model  is  a  rational  function  rather  than  a 
pole  series,  but  the  concept  is  the  same.  Both  of  these  problems  are  examples  of  the  more  general  one 
of  finding  the  model  that  best  represents  the  data  of  interest,  a  problem  also  known  as  system  identifica¬ 
tion.  A  general  discussion  of  modeling  data  is  given  in  Press  et  al.  (1992). 

For  illustrative  purposes,  consider  the  pole-spectrum  example  just  presented.  Based  on  the  results  of 
Fig.  lb,  it  appears  reasonable  to  select  a  candidate  FM  with  D  =  7  and  an  N  varying  from  6  to  9  in  order 
to  achieve  a  relative  maximum  in  B^y^,  using  the  requisite  data  set  S^p  needed  to  quantify  the  Fj^  pa¬ 
rameters.  Using  the  new  data  set  Sj^,  B^.^  can  then  be  computed  over  Fpj^.  In  order  to  illustrate  the 

effect  of  GM  uncertainty,  uniformly  distributed  random  noise  is  added  to  all  GM  samples  in  order  to  de¬ 
termine  the  degree  of  correlation  between  the  noise  level  and  B^^^p.  The  results  of  this  experiment  are 

plotted  in  Fig.  2. 


(DIGITS) 


Figure  2.  Shown  here  is  the  average  agreement  achieved  be¬ 
tween  a  fitting  model  and  additional  samples  of  the  generating 
model  for  the  example  of  Fig.  1 ,  as  a  function  of  the  generating- 
model  sample  accuracy.  The  fitting-mode!  denominator- 
polynomial  order  is  7  and  its  numerator-polynomial  order  is  a  pa¬ 
rameter.  This  result  indicates  the  feasibility  of  estimating  the 
accuracy  of  the  generating-model  samples  from  the  agreement 
they  exhibit  with  the  fitting  model. 
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The  quantity  controlled  by  the  dynamic  range  of  the  additive  noise,  is  plotted  on  the  horizontal 

axis  with  plotted  on  the  vertical  axis.  There  is  essentially  a  one-to-one  correlation  between 
and  Bj^/jp  up  to  values  between  5  and  6,  beycaid  which  variations  are  seen  that  can  be  deduced  from  Fig. 
lb  as  being  due  to  differences  in  the  FM  numerator-polynomial  order.  Note  that  this  computation  in¬ 
volves  no  a  priori  knowledge  of  Bq^^,  its  value  rather  being  inferred  from  B^p-  Thus,  when  using 
simulated  frequency-response  data,  it  appears  that  the  MBPE  procedure  does  ^ovide  a  qimtitatively 
valid  way  for  estimating  the  noise  level,  or  equivalently  the  accuracy,  of  the  data  being  modeled.  In  the 
next  section,  we  apply  Sie  technique  to  an  electromagnetic  problem. 


APPLICATION  TO  AN  ELECTROMAGNETIC  TRANSFER  FUNCTION 
The  problem  to  be  considered  is  the  frequency-dependent  current  excited  on  the  illuminated  side  of  an 
infinite  circular  cylinder  for  a  normally  incident,  TE  plane  wave.  This  problem  has  an  an^j^cal  solu¬ 
tion,  which  is  used  here,  but  has  also  become  a  benchmark  solution  for  vanous  C^  models.  Usi^ 
the  analytical  solution  permits  performing  a  computation  whose  accuracy  can  be  modified  m  a  contr^i^ 
fashion  by  adding  noise  to,  or  otherwise  affecting,  the  solution.  Here,  we  use  a  parameter  tlat  controls 
the  number  of  terms  summed  in  the  infinite  series  and  which  thereby  Umits  it  convergence  and  accuracy. 
M  other  calculations  are  otherwise  done  in  24-digit  computer  precision.  The  frequency  range  covered 
by  the  results  which  follow  are  for  ka  =  3  to  6  where  ka  is  the  cylinder  circumference  in  wavelengths. 

Shown  in  Fig  3a  are  results  for  the  CN  and  DF,  and  in  Fig.  3b  the  MF,  as  a  function  of  FM  numerator- 
pohnomial  order  with  the  FM  denominator-polynomial  order  a  parameter.  As  found  for  the  pole  spec- 
ttum  above,  the  CN  increases  and  the  DF  decreases  with  N,  with  their  sum  approximately  equal  to  the 
CP.  Similarly,  the  MF  is  found  to  increase  with  N  until,  as  is  seen  for  D  =  7,  the  effect  ot  the  increas¬ 
ing  CN  causes  it  to  decrease  after  reaching  a  peak  value  of  about  8.5  digits. 


Figure  3.  The  results  here  are  similar  to  those  of  Fig.  1  except  that  the  generating  model  is  the  anaVtical,  series  so- 
tirtwn  for  the  current  induced  on  the  illuminated  side  of  an  infinite,  circular  cylinder  by  a  normally.irwident  plane  wave 
and  sampled  between  ka = 3  and  6.  The  lower  condition  numbers  for  this  problem  compared  with  the  pote^enes  ex¬ 
ample  of  Figs.  1  and  2  results  in  more  accurate  agreement  between  the  fitting  and  generating  models  in  (b).  A  com¬ 
puter  precision  of  24  digits  was  used. 

For  the  parametera  used  to  model  this  particular  response,  it  appears  that  using  D  =  7  and  letting  N  ^ 
from  3  to  7  will  yield  the  best  MF  as  the  GM  accuracy  is  varied,  which  leads  to  the  result  shown  m  Fig. 
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4.  The  dependence  of  the  on  the  input-data  accuracy  is  similar  to  that  already  seen  in  Fig.  2.  Of 
most  significance  is  that  there  is  again  a  nearly  one-to-one  correspondence  between  and  in¬ 
dicating  the  potential  utility  of  the  procedure  for  estimating  the  accuracy  of  that  data,  and  hence,  of  the 
GM  that  produced  it  in  the  case  of  CEM.  The  effect  of  decreasing  the  input-data  accuracy  to  4  digits  for 
the  infinite-cylinder  generating  model,  as  shown  in  Fig.  5,  further  demonstrates  the  feasibility  of  deter¬ 
mining  the  generating-model  accuracy  from  its  agreement  with  the  fitting  model. 


(DIGITS) 

Figure  4.  Shown  here  is  the  average  agreement 
achieved  between  a  fitting  model  and  additional  samples 
of  the  EM  generating  model  for  the  example  of  Fig.  3,  as 
a  function  of  the  generating-model  sample  accuracy. 

The  fitting-model  denominator-polynomial  order  is  7  and 
the  numerator-polynomial  order  is  a  parameter.  This  re¬ 
sult  indicates  the  feasibility  of  estimating  the  accuracy  of 
CEM  samples  from  the  agreement  they  exhibit  with  the 
fitting  model. 


Rgure  5.  The  results  here  repeat  those  of  Fig.  3  for  the 
infinite-cylinder  generating  model,  but  the  input-data  ac¬ 
curacy  was  reduced  to  4  digits.  It  can  be  seen  that  if 
the  fiffing-model,  denominator-polynomial  order  is  5  or  6, 
the  fitting-model  agreement  to  the  input  data  is  within 

3x10"^  of  the  input-data  accuracy,  for  numerator- 
polynomial  orders  of  4  or  greater. 


A  more-detailed  demonstration  of  this  possibility  is  illustrated  in  Fig.  6  where  the  frequency  dependence 
of  the  MF  is  shown  with  the  input-data  accuracy  a  parameter.  The  parameter  L  is  the  digit  specification 
for  the  convergence  of  the  series  used  in  computing  the  infinite-cylinder  test  current,  i.e.,  the  series  is 

computed  until  rrj/Sum(i)l  <  10"^  where  Tj  is  the  i’th  term  in  the  series  and  Sum(i)  is  the  series  sum  up 
through  that  terni.  The  individual  Bjj^  for  each  L  are  seen  to  cluster  randomly  about  that  value  of  dig¬ 
its  thus  yielding  a  B^vlF  '"^lue  being  neariy  equal  to  L  as  shown  in  Fig.  4. 


CONCLUDING  COMMENTS 

A  major  deficiency  in  computational  electromagnetics  (CEM)  is  the  almost-universal  absence  of  anv 
cost-effective  way  to  estimate  the  accuracy  of  the  solutions  provided  by  numerical  models.  The  rationale 
for  the  procedure  described  in  this  paper  is  to  exploit  knowledge  of  the  macroscopic  behavior  of  electro¬ 
magnetic  observables  as  a  means  of  measuring  the  uncertainty  of  results  computed  from  CEM  models 
and  thereby  to  estimate  their  accuracy.  The  procedure  is  based  on  the  known,  resonance-like  frequency 
dependence  of  EM  observables,  a  behavior  well-approximated  by  a  rational  function  that  can  thus  serve 
as  a  fitting  model  (FM)  for  such  observables.  By  quantifying  the  FM  using  a  set  of  frequency  samples 
from  the  first-principles  or  generating  model  (GM),  the  computed  match,  or  fit,  between  this  FM  and 
additional  GM  samples  provides  an  accuracy  estimate  of  the  data  produced  by  the  GM.  This  possibility 
has  been  demonstrated  here  for  two  problems,  one  a  pole  spectrum  and  the  other  the  current  excited  on 
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an  infinite  circular  cylinder  by  an  incident  plane  wave. 

It  should  be  appreciated  that  quantifying  the  FM  requires  solution  of  a  “data”  matrix  whose  size  is  deter¬ 
mined  by  the  orders  of  the  numerator  and  denominator  polynomials  that  comprise  it,  and  which,  even 

though  generally  involving  fewer  than  20  equations,  can  have  a  condition  number  exceeding  10^®.  Be¬ 
cause  the  error  estimate  derived  from  the  FM  procedure  can  be  no  better  than  the  fit  achieved  between 
the  FM  and  the  original  GM  data,  the  computer  precision  required  will  generally  exceed  16  digits  to 
overcome  the  effects  of  data-matrix  ill-conditioning.  The  results  presented  here  were  obtained  using  24- 
digit  computer  precision.  It  should  also  be  appreciated  that  the  FM  is  not  unique  and  furthermore,  that 
some  experimentation  is  required  concerning  the  bandwidtii  covered  by  GM  samples  and  the  parameters 
used  for  the  FM.  Finally,  it’s  worth  noting  that  the  procedure  outlined  here  can  not  only  provide  an 
error  estimate  for  the  data  obtained  from  a  GM,  but  it  may  be  possible  to  improve  the  accuracy  of  the 
GM  samples  themselves  by  averaging  the  parameters  of  overlapping,  windowed  FMs.  Both  of  these 
procedures,  of  course,  derive  from  the  fact  the  oversampled  GM  data  is  available  whose  redundancy 
provides  the  additional  information  that  makes  such  operations  possible. 

Figure  6.  The  fitting-model  agreement  for 
each  additfonai  data  sample  for  the  cylinder 
generating  model,  with  the  input-data  accura¬ 
cy  a  parameter  ranging  from  tO’"'  to 
12  (i.e.,  10"*-)  IS  presented  here.  These  results 
^ow  that  “noise"  in  the  input  data  is  accu- 
®  rately  estimated  by  the  fitting  model.  The 
®  sam^  numbers  are  roughly  proportional  to 
5  frequency  over  a  range  of  cylinder  circumfer¬ 
ence  from  3  to  6  in  waveforms. 
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Abstract 

In  this  paper,  we  consider  with  particular  emphasis  to  error  and  execution  time,  the  imple¬ 
mentation  of  a  version  of  the  fast  multipole  method  for  scattering  by  large  objects.  In  contrast 
to  the  traditional  moment  method,  the  fast  multipole  method  (FMM)  has  CPU  re¬ 

quirement  per  incidence  angle.  This  substantially  reduced  CPU  time  is  achieved  by  subdividing 
the  far  zone  elements  into  groups  whose  weighted  contribution  is  then  interacted  with  the  test 
element.  The  size  of  the  groups  and  the  various  approximations  used  in  the  interaction  of  the 
groups  play  an  important  role  on  the  solution  accuracy,  but  so  far  the  effect  of  these  parameters 
on  the  solution  accuracy  has  not  been  considered.  Guidelines  are  given  for  choosing  the  various 
parameters  affecting  the  speed  and  solution  accuracy  of  the  method  such  as  the  near-group 
distance  and  sampling  rate  based  on  an  error  criterion.  Our  study  is  carried  out  with  reference 
to  a  version  of  the  FMM  referred  to  as  the  Fast  Far  Field  Approximation. 


1  Introduction 

Integral  equation  methods  and  in  particular  the  moment  method  were  introduced  first  in  the  1960s 
[1].  In  the  ensuing  years  the  moment  method  achieved  spectacular  success  and  was  applied  to 
model  a  wide  variety  of  complicated  electromagnetic  phenomena.  Among  its  chief  attractions  were 
the  exactness  of  the  solution  technique  and  the  ability  to  model  complicated  geometries  w'ith  ease. 
However,  these  advantages  come  at  a  price.  The  price  to  be  paid  for  the  exactness  of  the  solution 
was  the  full  matrix  systems,  a  necessary  part  of  the  technique.  Since  full  matrices  need  O(iV^) 
storage,  the  MoM  suffers  severe  problems  of  scalability,  especially  for  3D  applications  and  large 
2D  problems.  Recently,  researchers  have  been  trying  to  develop  IE  techniques  w'hich  make  the 
matrix  system  sparse  by  using  special  basis  functions  [2]  or  by  making  use  of  filtering  techniques 
to  isolate  the  dominant  elements  of  the  moment-method  matrix  [3].  Another  technique  relies  on 
clumping  the  matrix  elements  in  the  far-held  [4], [5], [6].  The  technique  which  we  have  adopted  is 
based  on  the  latter  principle  and  was  introduced  in  [7].  This  technique  reduces  the  operation  count 
in  the  iterative  solution  of  moment  method  problems  from  O(iV^)  to  for  large  N  and  is 

based  on  computing  the  interactions  between  different  elements  using  different  schemes  depending 
on  the  electrical  distance  between  the  elements.  For  small  interaction  distances,  the  exact  kernel 
is  employed  while  for  large  distances  an  approximate  kernel  is  used.  The  memory  requirement, 
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with  the  preclusion  of  matrix  factorization,  is  reduced  to  0{N),  thus  enabling  the  solution  of  larger 
problems. 


2  Study  Objectives 

Although  it  has  already  beeu  demoustrated  that  the  fast  multipole  method  has  ‘>”™" 

lower  computational  complexity  [5],  there  are  several  parameters  which  play  a  role  m  the  CPU 
requirements  and  accuracy  of  the  solution.  More  speciiicaUy,  the  groupmg  scheme  the  samphng 
rate  and  the  group  size,  all  have  an  effect  on  the  performance  of  the  FMM.  In  this  « 

examine  the  role  of  these  parameters  on  the  accuracy  and  computational  efficiency  of  the  F  i 
on  the  basis  of  some  error-criterion.  SpeciiicaUy,  we  considered  the  scattering  by  two  dimensional 
metaUic  structures  using  a  solution  of  the  EFIE  and  the  MFIE  with  pulse  basis  and  point  ma  ch  ^ 
[81  In  our  study,  the  Fast  Far  Field  Algorithm  (FAFFA)  [7],  a  variation  of  the  FMM  [6]  was  used 
for  carrying  out  the  matrix-vector  products.  Referring  to  Figure  1,  the  foUowing  parameters  were 


Group 


Figure  1;  The  process  of  grouping  unknowns  -  two  groups  are  in  the  near  field  of  each  other  if  the 
distance  between  their  centers  pi/i  is  less  than  dmin 


examined  with  respect  to  the  accuracy  and  efficiency  of  the  FAFFA: 

•  the  near-group  radius  ^dmin 

•  the  sampling  rate 

•  group  size,  and 

•  memory  requirements 

Our  benchmark  for  accuracy  was  the  RMS  error  given  by 


ERRORrms  = 


1  ^ 

^  J^El^CSREFii) 


-  RC Spri^f 


(1) 
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where  RCSref  denotes  the  reference  radar  cross  sections  as  computed  by  the  standard  moment 
method  approach  without  grouping,  RCSff  is  the  RCS  calculated  using  FAFFA  and  M  being  the 
number  of  points  at  which  the  RCS  is  computed.  We  note  that  this  error  formula  is  among  several 
that  have  been  considered,  but  has  been  found  to  represent  a  reasonable  measure  of  the  accuracy. 
It  has  been  employed  by  Schuh  and  Woo  [9]  for  a  study  on  the  accuracy  of  RCS  computations 
by  various  codes.  In  the  following  we  wiU  show  error  curves  as  a  function  of  the  aforementioned 
parameters  for  two  cylinders.  One  is  a  PEC  isosceles  triangle  having  a  base  2.5A  in  length  and 
a  height  of  17. 95 A.  The  other  is  a  rectangular  PEC  cylinder  25A  x  4A  in  size.  These  geometries 
(see  Figure  2)  are  analyzed  at  grazing  (0  degrees)  incidence  since  this  excitation  is  most  stressing 


y 

25X  ^ 

(b) 

Figure  2:  (a)  An  isosceles  triangular  metallic  cylinder  -  17.95A  high  with  a  2.5A  base  (b)  A  rectan¬ 
gular  metallic  cylinder  -  25A  x  4A 

for  RCS  calculations,  particularly  so  for  the  triangular  cylinder.  In  fact,  the  two  geometries  were 
selected  to  have  different  characteristics  in  the  backscatter  region. 

2.1  RMS  Error  as  a  function  of  near-group  separation  distance  and  sampling  rate 

First  we  look  at  the  RMS  error  of  the  bistatic  RCS  pattern  as  a  function  of  the  near-group  radius, 
drain-  Such  error  curves  are  shown  in  Figures  3  and  4  for  the  isosceles  and  rectangular  cylinders, 
respectively.  These  figures  depicts  several  curves,  each  corresponding  to  a  different  sampling  rate 
(A/10,  A/15,  A/20,  A/30  or  A/40  as  designated  in  the  figure).  Surprisingly,  the  error  curves  for 
each  geometry  give  rise  to  the  same  conclusion.  Basically,  Figures  3  and  4  demonstrate  that  the 
sampling  rate  has  a  profound  effect  on  the  accuracy  of  the  solution  and  the  value  of  the  error  is 
strongly  dependent  on  the  near-group  radius.  For  example,  if  dmin  is  set  to  1.7A,  a  tessellation  rate 
of  10  segments  per  wavelength  yields  an  RMS  error  of  4.79  dB  whereas  a  tessellation  rate  of  30 
segments  per  wavelength  leads  to  an  RMS  error  down  to  1.19  dB.  As  expected,  a  higher  sampling 
rate  leads  to  smaller  errors.  However,  these  errors  are  different  and  much  larger  than  those  resulting 
from  a  standard  moment  method  implementation.  Specifically,  a  tessellation  rate  of  10  segments 
per  wavelength  yields  an  RMS  error  of  0.1615  dB  when  employing  the  standard  unreduced  moment 
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Figure  3:  Error  curves  tor  the  FAFFA  for  a  isosceles  triangular  metallic  cyUnder  ■  17.95A  high  with 
a  2.5A  base 

2.2  CPU  requirements  .  j  r 

Returning  now  to  Figures  3  and  “i  "e  ‘alw“te  tnJ 

the  triangular  “gall,  eylind^^  with  the  same  sampling  rate.  Unfortunately, 

d„,.„  must  be  at  least  ^  f  ,be  value  often  suggested  by  other  users 

these  values  for  d^in  are  much  larger  \  important  to  keep  the  near-group 

of  the  fast  multipole 

L"  fo  S  s“gm“  1  per  wavelength  (for  the  isosceles  triangle)  if  we  are  .0  reduce  d„i„  .0  2A, 
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but  clearly  the  increased  sampling  is  counter  to  the  benefits  from  a  smaller  near-group  distance. 
Thus  in  Figures  6  and  7,  we  show  curves  of  the  CPU  time  as  a  function  of  the  near-group  distance 
for  different  sampling  rates.  Figure  6  refers  to  the  triangular  cylinder  whereas  Figure  7  corresponds 
to  the  rectangular  cylinder,  and  both  show  a  similar  dependence  on  dmin  and  the  sampling  rate. 
For  both  cylinders  the  CPU  time  increases  almost  linearly  with  dmin  and  is,  of  course,  higher  for 
denser  tessellations.  These  curves  can  be  used  in  conjunction  with  those  in  Figures  3  and  4  to 
obtain  the  CPU  time  to  determine  dmin  for  a  given  RMS  error  and  sampling  rate.  Alternatively, 
Figures  3  and  4  can  be  consolidated  with  the  data  in  Figures  6  and  7  to  obtain  a  new  set  of  curves 
which  explicitly  give  the  CPU  time  as  a  function  of  a  desired/given  RMS  error  for  the  selected 
sampling  rate.  These  CPU  curves  as  a  function  of  a  specified  RMS  error  are  shown  in  Figures  8 
and  9,  corresponding  to  the  triangular  and  rectangular  cylinders,  respectively.  The  trends  in  each 
of  these  Figures  are  identical,  demonstrating  that  the  dependence  between  CPU  time,  sampling, 
near  group-distance  and  RMS  error  are  likely  the  same  regardless  of  geometry  considerations. 

Basically,  Figures  8  and  9  show  that  the  CPU  time  increases  quadratically  with  sampling 
rate.  Clearly,  a  lower  sampling  rate  is  more  attractive  in  terms  of  CPU  time  since  it  leads  to 
smaller  systems  and  therefore  faster  convergence  rate.  That  is,  it  is  better  to  use  a  larger  dmin 
with  lower  sampling  rates  rather  than  a  higher  sampling  rate  with  a  lower  near-group  window 
radius.  For  example,  to  achieve  an  RMS  error  of  1  dB  for  the  triangular  cylinder,  the  CPU  time 
is  about  10  sec  when  using  a  tessellation  rate  of  10  segments  per  wavelength  and  from  Figure  3 
the  corresponding  dmin  is  3.75A.  However,  to  achieve  the  same  error  using  a  tessellation  rate  of 
30  segments  per  wavelength,  the  corresponding  CPU  time  is  more  than  five  times  higher  but  the 
required  near-group  window  radius  is  less  than  2.5  wavelengths.  It  may  seem  that  the  higher  number 
of  degrees  of  freedom  associated  with  higher  segmentation  rates  leads  to  unequivocally  higher 
memory  requirements  as  well.  However,  this  is  not  always  the  case  with  the  FAFFA  implementation 
because  the  value  of  dmin  plays  a  major  role  on  the  bandwidth  of  the  dense  section  of  the  matrix. 
For  example,  in  the  case  of  the  rectangular  cylinder,  sampling  at  A/ 10  would  imply  580  unknowns 
while  sampling  at  A/30  would  imply  1740  unknowns.  To  achieve  a  RMS  error  of  3.75  dB,  it  would 
require  a  near  group  window  of  3.5A  for  sampling  at  A/10  and  a  window  of  lA  for  sampling  at  A/30. 
Execution  time  considerations  are  of  the  order  of  20  seconds  for  the  first  case  and  75  seconds  for 
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Observation  angle  (])  (deg.) 

Figure  5:  Comparison  of  bistatic  patterns  of  an  isosceles  triangular  pec  cyUnder  -  17.95A  high  with 
a  2.5A  base  computed  using  the  FAFFA  at  different  sampling  rates 


the  latter  case.  An  examination  of  the  grouping  of  the  unknowns  would  reveal  that  the  memory 
requirement  for  the  latter  case  is  smaUer.  Sampling  at  A/30  and  employing  a  near  group  radius  of 
lA  would  mean  a  storage  requirement  of  -  42  elements  per  row.  SampUng  at  A/10  and  employing 
a  near  group  radius  of  3.5A  would  mean  a  storage  requirement  of  -  72  elements  per  row,  which 
is  higher  than  the  previous  case.  Consequently,  although  Figures  8  and  9  suggest  that  a  lower 
sampling  rate  has  the  least  CPU  requirements  in  satisfying  a  given  error  criterion,  the  memory 
requirements  will  be  higher  when  compared  to  those  associated  with  a  higher  sampling  error  an 
the  same  error  criterion.  Memory  considerations  wiU  of  course  be  an  issue  for  very  large  simulations 
and  the  choice  of  using  a  high  sampling  or  not  wiU  depend  on  the  available  computing  resources. 


Radius  for  near-group  treatment,  (X) 

Figure  6:  Time  curves  for  the  FAFFA  for  a  isosceles  triangular  pec  cyUnder  -  17.95A  high  with  a 
2.5A  base 
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Radius  for  near-group  treatment,  d^^,,  (X) 


Figure  7:  Time  curves  for  the  FAFFA  for  a  rectangular  pec  cylinder  -  25A  x  4A 


Figure  8:  Time  -  error  curves  for  the  FAFFA  for  a  isosceles  triangular  pec  cylinder  -  17.95A  high 
with  a  2.5A  base 


3  Conclusion 

In  this  paper  we  looked  at  the  CPU  requirements  of  the  FAFFA  as  a  function  of  the  different 
parameters  affecting  its  performance,  including  the  near-group  window  radius,  sampling  rate  and 
error.  We  presented  curves  which  show  the  required  sampling  rates,  near-group  window  radius  and 
CPU  time  as  a  function  of  a  given  error  criterion.  Based  on  these  curves,  one  concludes  that 

•  The  error  is  strongly  dependent  on  dmin  and  the  sampling  rate.  This  error  dependence 
on  the  sampling  is  particularly  inherent  to  the  FAFFA  solution  method  and  should  not  be 
confused  with  similar  errors  associated  with  the  unreduced  moment  method  implementations. 
With  higher  sampling,  the  spatial  extent  of  each  group  is  reduced  and  thus  the  process  of 
aggregation  and  disaggregation  of  group  elements  into  group  centers  induces  less  error. 

•  In  lieu  of  CPU  time  efficiency,  it  is  best  to  employ  lower  sampling  and  larger  near-group 
window  radius. 
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Figure  9:  Time  -  error  curves  for  the  FAFFA  for  a  rectangular  pec  cylinder  -  2oA  X  4A 

•  Stringent  memory  requirements  behooves  the  adoption  of  high  sampling  and  small  near  group 
radius  but  at  the  expense  of  high  execution  time. 
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Abstract 

The  covolume  approach  for  solving  Maxwell’s  equations  is  the  most  natural  generalization  to  tetrahedral 
meshes  of  the  FDTD  method  for  rectangular  meshes.  In  this  paper  a  summary  of  the  available  error  estimates 
is  given.  These  estimates  show  first  order  accuracy  for  three  dimensional  tetrahedral  meshes.  For  rectangular 
graded  meshes  the  estimates  show  second  order  accuracy. 


1  Introduction 

Staggered  mesh  schemes  for  the  numerical  solution  of  Maxwell’s  equations  go  back  as  far  as  [19].  Over  the  years 
this  scheme  has  seen  a  number  of  generalizations  intended  to  enhance  its  usefulness.  Mostly,  these  generalizations 
are  aimed  at  increasing  the  geometric  complexity  that  can  be  handled.  Thus,  in  [3]  a  tensor  formulation  was  given. 
This  permits  the  method  to  be  extended  from  rectangular  meshes  to  meshes  defined  by  curvilinear  coordinates. 
Two  dimensional  generalizations  using  quadrilateral  meshes  are  given  in  [6]  and  [8]. 

For  three  dimensions,  hexahedral  mesh  formulations  were  proposed  in  [7]  and  [9].  These  formulations  use 
interpolation  to  obtain  values  for  nonbasic  field  components-those  which,  in  the  rectangular  situation,  lie  along 
the  primal  or  dual  mesh  edges.  Another  approach  is  the  “control  path”  method  of  [4],  [5].  The  control  path 
method  uses  the  classical  finite  difference  approach  of  modifying  the  finite  difference  stencil  near  the  boundary  of 
the  domain. 

A  natural  generalization  of  the  standard  staggered  mesh  scheme  to  tetrahedral  meshes  was  given  in  [10].  A 
similar  technique  was  used  independently  in  [14]  to  solve  the  Navier-Stokes  equations.  This  “covolume”  approach 
does  not  use  any  interpolations  and  is  a  very  natural  way  to  generalize  the  original  rectangular  staggered  mesh 
approach.  A  characteristic  feature  of  covolume  schemes  is  the  use  of  Voronoi-Delaunay  mesh  pairs  to  replace  the 
rectangular  staggered  mesh  arrangement. 

Very  little  rigorous  analysis  of  staggered  mesh  schemes  for  electromagnetics  is  available.  We  know  only  of  the 
rectangular  mesh  analysis  of  [11]  and  [12].  (For  the  incompressible  Navier-Stokes  equations  results  may  be  found 
in  [14],  [15]  and  [17].)  In  this  note  we  will  survey  some  recent  results  we  have  obtained  about  the  convergence  of 
the  covolume  algorithm,  and  discuss  the  corresponding  results  for  the  rectangular  mesh  (Yee)  scheme. 

We  begin  with  a  section  on  Voronoi-Delaunay  meshes.  Then  we  cover  the  general  error  estimate  for  the 
semidiscrete  (i.e.  continuous  time,  discrete  space)  approximation.  This  is  followed  by  results  for  the  classical 
“leapfrog”  time  discretization.  Finally,  we  show  that  for  graded  rectangular  meshes  the  accuracy  is  one  higher 
order  than  for  the  general  tetrahedral  case.  Proofs  of  the  results  are  given  in  our  paper  [16]. 

2  Mesh  Notations  and  Discrete  Vector  Fields 

Assume  that  the  polyhedral  domain  fi  has  a  primal  family  of  finite  element  style  tetrahedral  partitions,  param¬ 
eterized  by  the  maximum  side  length  which  is  generically  denoted  by  h.  We  will  assume  that  the  ratio  of  radii 

iThis  work  was  supported  by  the  US  Air  Force  Office  of  Scientific  Research  under  Grant  F-49620-94-0311. 
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of  circumscribing  spheres  and  inscribed  spheres  of  all  the  individual  tetrahedra  axe  uniformly  bounded  above  and 
below  as  h  approaches  0.  A  dual  mesh  is  formed  by  connecting  adjacent  tetrahedral  circumcenters  and,  in  the  case 
of  tetrahedron  with  a  face  on  a  boundary,  by  connecting  their  circumcenters  with  those  of  their  boundary  fa.ces. 
By  elementary  geometry  these  dual  edges  are  perpendicular  to  the  associated  tetrahedral  faces.  These  connections 
also  form  the  edges  of  a  set  of  polyhedra.  It  follows  from  elementary  geometry  that  the  edges  of  tetrahedra  are 
perpendicular  to  and  in  one-to-one  correspondence  with  the  faces  of  dual  polyhedra  or  “covolumes” .  The  reciprocal 
orthogonality  between  edges  cind  faces  is  the  key  to  the  results  which  follow. 

The  N  nodes  of  the  tetrahedral  mesh  are  assumed  to  be  numbered  sequentially  in  some  convenient  way,  and 
likewise  the  T  nodes  of  dual  mesh.  Similarly  the  F  faces  (edges)  and  M  edges  (faces)  of  the  primal  (dual)  mesh 
are  sequentially  numbered.  The  individual  tetrahedra,  faces,  edges  and  nodes  of  the  primal  mesh  are  denoted  by 
Ti,  Kj,  Ok  and  Vi  respectively.  Those  of  the  dual  mesh  are  denoted  by  primed  quantities  such  as  cr'..  A  direction 
is  assigned  to  each  primal  edge  by  the  rule  that  the  positive  direction  is  from  low  to  high  node  number.  The  dual 
edges  are  directed  by  the  corresponding  rule.  We  also  denote  by  Fi  the  number  of  tetrahedral  interior  faces  (or 
dual  edges)  and  by  Mi  the  number  of  tetrahedral  interior  edges  (or  dual  faces). 


Figure  1 

Let  Sj  denote  the  area  of  Kj  and  h'j  the  length  of  tr'-  In  where  Fi  denotes  the  number  of  interior  primal 


faces,  we  will  introduce  the  inner  product  (•,  •)w'  defined  by 

{u,v)w  •=  ^2  ~  {Su,D'v)  —  (D'ujSv)  (2.1) 

and  denote  the  resulting  inner  product  space  by  W  and  the  associated  norm  by 

Ijulliv  :=  (2.2) 

In  (2.1),  (•,■)  denotes  the  standard  Euclidean  inner  product,  S  :=  diag(Sj),  D'  :=  diag(h^)  and  W  :=  SD' 
are  Fi  x  Fi  invertible  diagonal  matrices.  The  norm  defined  by  (2.2)  is  clearly  three  times  a  discrete  norm. 
Similarly,  we  will  introduce  an  inner  product  in  where  M  denotes  the  number  of  primal  edges  : 

(u,  v)w'  ■■=  X) 

and  denote  the  inner  product  space  by  W.  The  associated  norm  is 

j|u|!w^/ :=  (u, «)^,.  (2.4) 
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The  notations  in(2.3)  and  (2.4)  correspond  to  those  of  (2.1)  and  (2.2).  For  example,  s’j  denotes  the  area  of  dual 
face  Kj. 

For  each  primal  face  a  discrete  circulation  is  defined  by 

:=  (2-5) 

(TjedKi 

Similarly,  for  each  interior  covolume  face  k'^  the  discrete  circulation  is 

(C'u)^.  :=  E  (2-6) 

A  tilde  on  hj  or  h'^  means  that  the  quantity  is  to  be  taken  with  a  negative  sign  if  the  dual  edge  is  directed  against 
the  positive  sense  of  description  of  dni  or  respectively,  and  with  a  positive  sign  otherwise. 

For  each  tetrahedron  r;  a  discrete  flux  is  defined  by 

By  Sj  we  mean  sj  negatively  signed  if  the  corresponding  velocity  component  is  directed  towards  the  inside  of 
and  positively  signed  otherwise. 

Using  these  definitions  we  are  able  to  prove  discrete  analogs  of  formulas 

[  curl  E  ■  H  dx  =  f  curl  H  •  E  dx 
Jq  Jq 

which  holds  when  E x n  =  0  on  F,  and  /  u  •  grad <i>dx=  I  (div  u)  4> dx  for  (fl), u  €  /f(div;  fl) ,  respectively, 

JQ  Jq 

and  also  of  the  identity  div(curl  u)  =  0. 

See  [13]  and  [18]  for  more  details  on  this  aspect  of  covolume  discretizations. 


3  Semi-Discrete  Maxwell’s  Equations 

We  introduce,  for  general  field  A,  its  “face  averages”  Af  e  and  its  “edge  averages  ”  Ag  G  as  follows  (refer 
to  Figure  1): 

Af  :=  —  /  A  •  nds, 

J  k; 

A'f-.^-^J  A -nds, 

Ag  ^ J  A-tda, 

"■jJ  cr'. 

where  n  and  t  denote  the  unit  normal  vector  to  the  face  Kj  (or  «:[)  and  the  unit  tangent  vector  to  the  edge  aj  (or 
a'j),  respectively.  Error  functions  for  primal  edges  and  faces  will  be  denoted  by 

€A:=A-Af, 

T]A  :=A~Ae, 

~  Af  ~  Ag. 

Error  functions  for  dual  edges  and  faces  are  defined  similarly. 
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As  shown  in  Figure  1,  for  each  tetrahedron  n  we  will  use  the  normal  components  of  the  magnetic  field  H  to 
its  faces  and  the  tangential  components  of  the  electric  field  £}  in  the  directions  of  its  edges. 

Throughout  this  paper  we  will  consider  the  solution  of  Maxwell’s  curl  equations  in  a  bounded  domain  fl  with 
the  boundary  condition  E  x  nir  =  0. 

Now  integrate  both  sides  of  Ampere’s  law, 


c 


dt 


-curlH  =  J, 


over  the  co-face  «'■  to  obtain 

j  J{x,t)dx.  (3.1) 

Here,  {Ef),  denotes  the  average  of  E  •  nj  over  the  face  k'^  where  the  %  is  the  unit  vector  in  the  direction  of  aj, 
and  (C"i7e)ic'  is  the  discrete  circulation  around  the  face  Similarly,  from  Faraday’s  law 

-f  curlE  =  0, 
dt 

=  0  (3.2) 

dt 

where  denotes  the  average  of  H  •  nj  where  Uj  is  the  unit  normal  to  the  face  Ki  and  {CEe),^  is  the  discrete 

circulation  around  the  face  Kj. 

Let  E  and  H  denote  vectors  of  components  in  and  respectively.  Then  (3.1)  and  (3.2)  (which  are 
exact)  suggest  the  approximations  (where  it  is  implied  that  components  of  E  associated  with  boundary  edges  are 
zero,  i.e.  E\r  =  0) 

=  (3.3) 

at 

fiS^  +  CE  =  0  (3.4) 

dt 

where  J  G  Since  fiS  and  eS'  are  invertible,  (3.3)-(3.4)  is  a  system  of  linear  ordinary  differential  equations, 
and  the  existence  and  uniqueness  of  a  solutions  follow  from  well  known  results. 

Using  (3.4)  we  can  show  that 

|(M)  =  0. 

This  shows  the  sense  in  which  -^divE  =  0  is  satisfied  at  the  discrete  level  in  the  covolume  scheme. 
dt 

The  basic  estimate  for  (3.3)-(3.4)  is  then: 

Theorem  1  Denote  by  {E,H)  the  solution  of  (S.S)  and  (3.4)  and  by 

(E,H)  G  H"^’Ho,r;(lF^’'’(0))^)'* 

the  solution  of  Maxwell’s  equations  with  p  >  2.  Then  we  have  the  estimate 

^m^(€ll(E  -  Ee)(t)l|iv<  -f  -  ^fe)Wlltv)  <  Rh(jjE||ii(o.Ti(Wi.P(n))3)  +  I1H1|i,i(0.T;(IV1.p(O))S)). 

Thus,  the  basic  scheme  is  first  order  accuracy. 


4  The  Fully  Discrete  problem 

There  are  many  possible  time  stepping  methods  that  can  be  applied  to  (3.3)  and  (3.4).  We  will  discuss  a  leapfrog 
scheme  which  is  very  popular  in  computational  electromagnetics  (see  [19]).  In  this  scheme  we  approximate  E(f)  at 
times  t„  =  nAt,  0<n<oo  with  a  vector  H(i)  at  time  t^+x  with  a  vector  •  The  initial 
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value  Ul  can  be  computed  using,  for  example,  a  Taylor  expansion.  Given  the  next  approximation 

is  obtained  by  solving  the  equations 


e5'(5"+i  _£«)  _  =  J"+2 

+  AtC£"+^  =  0 


where 


-L 


(n+l)Ae 

Jdt. 

nAt 


(4.1),  (4.2)  is  an  explicit  scheme,  so  the  existence  and  uniqueness  of  a  solution  are  apparent, 
functions  defined  last  section,  we  can  rewrite  (4.1)-(4.2)  as 


(4.1) 

(4.2) 


Using  the  error 


-  €e)  =  {At)C'ri^^  +  G",  (4.3) 

-  4+^ )  =  -(At)G77i^+'  +  G"„.  (4.4) 

By  a  direct  computation,  G"  and  G"  are  given  by 

G”  =  -  Ef)  +  AtC’He'^^,  (4.5) 

G"  =  -  F;+^)  -  AtCEp+K  (4.6) 


For  the  fully  discrete  scheme  we  expect  a  stability  condition  to  appear  in  the  error  estimate  and  it  does  so. 
Theorem  2  let  denote  the  solution  of  (4.1)-(4.2),  and  let 


(E,H)  G  (i?^(0,T;(lU^’J’(fi))3))2 


denote  the  solution  of  Maxwell’s  equations  with  p>2.  Under  the  stability  condition 


cAt  < 


min(hy) 


(4.7) 


where  c  =  (e/z)“2  is  the  speed  of  the  light  in  the  medium,  M2  is  the  maximum  of  the  ratios  of  the  maximum  to 
minimum  side-lengths  over  the  union  of  adjacent  tetrahedra  and  M3  is  the  maximum  number  of  edges  over  all 
co-faces,  we  have  the  following  error  estimate  for  the  fully  discrete  scheme  (4.1)-(4.2) 


^1\\W'  -F  ^  -  He  Nltv)  <  7i'/l(||Ej|/fi(o,T;(W/l-P(n))3)  +  ||H|j^i(0,T;(H^i.p(n))3)).  (4.8) 


Once  again  the  error  is  first  order  in  h  on  tetrahedral  mesh. 


5  On  Rectangular  Meshes 

An  example  of  a  rectangular  mesh  in  three  dimensions  is  one  obtained  by  subdividing  a  rectangular  parallelepiped 
by  planes  parallel  to  its  faces.  E  the  planes  are  equally  spaced  along  each  of  the  directions  the  mesh  is  “uniform”.  E 
the  spacings  vary  from  point  to  point  the  mesh  is  “graded”.  For  the  graded  meshes  both  the  primal  and  dual  mesh 
faces  are  rectangular  and  are  mutually  perpendicular.  The  duality  relations  discussed  in  section  2  are  preserved. 
The  resulting  discretization  of  Maxwell’s  equations  is  the  standard  Yee  scheme  [19]. 

For  the  semidiscrete  equations  there  is  the  following  estimate 

Theorem  3  Suppose  that  (E,H)  6  L^(0,r;  (H®(n))®)2  satisfies  Maxwell’s  equations,  and  denote  by  (E,H)  the 
solution  of  (3.3)-(3.4)  on  graded  meshes  with  maximum  grid  size  h.  Then 

~ ^  ■^^^lK®)H)||(ii(o,r;(H3(n))s))2-  (5-1) 

Here  K  depends  on  the  maximum  of  the  ratios  of  the  maximum  to  minimum  side-lengths  in  each  graded  cell. 
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A  similar  estimate  holds  for  the  fully  discrete  scheme  under  a  stability  condition. 

The  rectangular  scheme  is  therefore  second  order  accurate  in  h.  This  result  implies  that  the  rectangular 
approximation  is  inherently  more  accurate  than  the  tetrahedral  approximation,  the  reasoning  behind  this  statement 
is  that  second  order  accuracy  for  tetrahedral  meshes  can  be  obtained  only  for  very  slowly  varying  meshes.  (See 

[14]  for  the  two  dimensional  case)  For  the  rectangular  case,  however,  there  is  no  requirement  slow  variation  in  the 
mesh  grading.  We  do  not  know  at  present  how  to  reconcile  these  situations  although  presumably,  local  symmetry 
considerations  are  important. 
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Abstract 

the  .heory  a,  i.  th=  cod,  CARDS. 

1  Introduction 

this  paper,  »e  describe  recent  progress  on 

of  reaction-diffusion  equations.  We  of  a  general 

adlptetTcL”^^  Adaptive  Reaction-Diilnsion  Solver,  fo,  the  nunterical  solution 

of  reaction-diffusion  equations 

Uf-V.(DVu)  =  F(u), 

posed  on  a  domain  that  is  a  product  of  a  time  .Hh  ^ ^  Is 

2.  or  3.  Here,  u  represents  a  k„d  time  and  D  is  a  semi-positive 

the  diffusion  term.  F  is  a  vector- valued  fun  specified  at  the  beginning  of 

definite  m  x  m  on  subsets  of  the  boundary  of  as  required, 

the  time  interval,  and  by  Dinchlet  or  .v.terns  of  reaction-diffusion  equations;  well-known  applications 

ncc::  f^J  e^r;t:  .XX"'Srs  d;a.n.cs.  Xbe  interaction  between 
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the  nonlinear  reaction  and  the  diffusion  gives  rise  to  interesting  behavior  such  as  finite  time  blowup,  long  time 
behavior  such  as  metastability,  and  localized  behavior  such  as  fast  transients,  steep  fronts,  and  pattern  formation. 
The  variety  of  applications  and  interesting  solutions  have  excited  much  interest,  but  the  nonlinear  nature  of  these 
problems  makes  the  mathematical  analysis  of  solutions  exceedingly  difficult.  For  this  reason,  we  turn  to  numerical 
analysis  to  get  information  about  the  solutions.  Yet,  this  poses  a  new  set  of  difficulties  because  the  nonlinear 
nature  of  the  problems  has  also  deterred  precise  mathematical  analysis  of  numerical  methods  for  reaction-diffusion 
problems.  In  fact,  most  of  the  numerical  results  in  the  literature  are  missing  even  a  rough  quantitative  estimate 
of  the  size  of  the  error.  This  is  particularly  troublesome  considering  that  so  little  is  known  about  the  solutions 
themselves,  increasing  the  reliance  on  numerical  methods  “working’'  as  desired.  Moreover,  the  fact  that  solutions 
typically  exhibit  several  scales  in  their  space  and  time  behavior  make  accurate  representation  difficult. 

Me  attempt  to  deal  with  these  problems  by  using  adaptive  finite  element  methods  with  error  control  based 
on  feedback  from  the  computation.  The  error  control  rests  on  a  recently  developed  theory  of  a  posieriori  error 
analysis  in  which  the  error  is  bounded  by  computable  or  approximatible  quantities  that  depend  on  the  numerical 
solution  rather  than  the  unknown  true  solution.  The  error  estimators  indicate  the  proper  choice  of  resolution  to 
achieve  the  required  accuracy,  while  the  use  of  adaptive  meshing  enables  the  computational  resources  to  be  used 
where  needed  in  order  to  be  efficient.  The  theory  has  been  worked  out  completely  for  linear  parabolic  problems 
[3]:  there  has  been  much  progress  on  systems  of  nonlinear  ordinary  differential  equations  of  fixed  dimension  [5]; 
and  also  some  work  on  nonlinear  parabolic  problems  [4].  We  have  recently  developed  the  theory  further  in  (8]  and 
we  describe  some  of  the  work  in  this  paper.  There  is  much  remaining  to  be  understood,  and  we  describe  some  of 
these  issues  as  well. 

The  main  deterrence  to  the  widespread  use  of  adaptive  methods  in  differential  equations  is  the  computational 
complexity  reqnired  to  code  them.  We  believe  that  it  is  possible  to  reduce  the  implementation  advantage  of 
simple  explicit  methods  over  adaptive  methods  for  the  genera!  user  because  many  ingredients  of  adaptive  codes 
for  differentia]  equations  are  common  to  all  sorts  of  problems.  Making  these  ingredients  accessible  in  a  general 
way  reduces  the  overhead  of  implementation.  We  describe  the  issues  of  implementation  of  adaptive  codes  on 
parallel  computers  in  detail  in  [13],  [12],  [11],  and  [10]. 

2  The  Bistable  Equation 

To  simplify  the  presentation  of  the  theory,  we  concentrate  on  the  bisiabte  problem, 

=  0<.r<l,0<f,  (2) 

with  f  >  0  small  and  Neumann  or  Dirichlet  boundary  conditions,  which  models  for  example  the  motion  of 
domain  walls  in  ferromagnetic  materials.  The  bistable  equation  is  one  of  the  simplest  problems  that  produce 
nonlinear  relaxation  to  equilibrium  in  tlie  presence  of  competing  stable  steady  states.  The  stable  steady  states 
are  u  =  1  and  u  =  -1,  which  are  minimizers  of  the  energy.  For  generic  initial  data,  approaches  one  of 

these  steady  states  in  the  limit  of  large  time.  But,  this  convergence  can  be  extremely  slow  because  solutions  of 
(2)  can  exhibit  dynamic  metaslahHiiy.  In  general,  v  forms  a  pattern  of  transition  layers  between  the  values  1  and 
-1,  where  the  layer  thickness  is  of  order  e.  The  subsequent  time  scale  for  substantial  horizontal  motion  of  the 
layers  is  exp(Cd/f),  where  C  =  0(1)  and  d  is  the  minimum  distance  between  layers  or  between  the  layers  and  the 
boundaries.  Metastable  solutions  are  not  local  minimizers  of  the  energy,  and  thus  are  always  dynamic.  After  a 
metastable  period,  one  or  more  of  the  layers  disappear  in  a  relatively  quick  transient  and  the  system  forms  a  new 
metastable  pattern.  This  repeats  until  the  eventual  convergence  to  a  steady  state.  We  illustrate  this  behavior 
in  Fig.  la,  where  we  plot  the  evolution  from  an  nearly  nietastable  initial  condition  consisting  of  two  “well.?" 
of  different  thicknesses  with  e  =  0.03  In  Fig.  la,  we  plot  the  evolution  of  the  approximation  from  this  initial 
condition.  The  left  well  is  slightly  thinner  than  the  right  and  collapses  around  time  41,  while  the  well  on  the  right 
collapses  at  time  141.  The  solution  exhibits  metastability  during  the  time  before  41  and  between  the  two  times. 
In  [6],  it  was  shown  that  the  time  scale  for  the  evolution  of  a  numerical  approximation  of  a  metastable  solution 
depends  critically  on  the  accuracy  of  the  space  discretization,  as  well  as  the  time  discretization.  In  particular, 
inaccurate  discretization  causes  the  approximate  metastable  pattern  to  become  stable,  whereas  a  sufficiently 
accurate  approximation  evolves  on  the  same  time  scale  as  the  true  solution.  In  a  similar  vein  in  [9],  we  report 
on  computations  made  on  a  system  of  reaction-diffusion  equations  modelling  shear  flow  in  a  Newtonian  fluid 
with  temperature  dependent  viscosity  that  illustrate  the  bad  effect  on  the  long  time  evolution  of  approximations 
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Figure  1:  (a)  Evolution  of  metastable  solution  computed  with  a  relative  accuracy 
stability  factor. 


(b)  The  associated 


resulting  from  too  coarse  uniform  space  discretization, 
mesh  computations  in  such  situations. 


Those  results  also  illustrate  the  effectiveness  of  adaptive 


= uri."  "2"  -  • 

nietliod  for  a  scalar  problem  of  the  form 

Ut  —  Uj-r  =  /(n-2;,<). 

that  yields  a  bound  on  the  error  e(xJ)  =  n(^.0  - 

..  .  ■  IK-  nn  f  is  a  measure  of  the  time  discretization  like  the  larpst  time 

where  Q  is  a  constant  depending  principa  ly  space  discretization  like  the  largest  mesh  size  and  r 

step  ^vhile  p  is  an  integer  and  similarly /-  is  ^  dpallv  on  the  size  of  derivatives  of  the  solution  u  up  to  an 

'  rr^cr^S  ”  cc  U.  .1.  ma.mum  or  roof  mean  .oare  norms. 

^"tIk  exponential  factor  exp(C,()  is  called  the  k^OTurand  I' " 

|lu{.,t)-tH-,t)II<  +  h  )C(u), 

in  other  words  without  a  factor  that  fJ“tiXthe^ errlTali'  approximation  of  a 

iit:  orri^e^ 

‘^t'?::::;urS-uHordeter.nin^^^^ 

the  correct  time  scale?  Actually,  not  very  y,  tio„,  ,he  stability  '’‘P'f' /'  > 

the  stability  factor  grows  very  y  .  p„ximately  exp(1000().  This  effectively  rules  out 

tor  C  ~  1.  For  the  computation  shown  in  Fig.  _  h  PP  y,  „„  answering  the  question^ 

any  accuracy  after  a  very  short  ‘"'“h.'™'  '"1,®';“ answer  is  given  in  terms  of  the  truncation  or  local 
How  well  Joes  ihe  sol.ff.n  ,||  ,l,e  local  errors  add  constructively,  rather  than  with  some 

:reiUtltaTthi's%roauces  the  exponential  growth  in  the  bound. 
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4  An  Overview  of  A  Posteriori  Error  Analysis 

Developing  a  theory  of  computational  error  estimation  is  a  main  goal  of  our  research.  The  theory  is  based  on 
an  a  posteriori  error  analysis  that  answers  the  question:  ‘‘How  well  does  the  appvotimaiiov  salisfy  Ike  differeniwl 
eqvaUon?  .  The  answer  is  given  by  the  residual  7?.,  defined  roughly  speaking  as; 

72(/)  =  Ilf/, -ir„- /(toil. 

This  quantity  is  computable,  since  it  is  written  in  terms  of  derivatives  of  the  approximation.  For  the  methods  we 
use  -  +  Tit  decomposes  into  space  and  time  residual  errors  naturally.  The  residual  error  is  not  the  error 

instead  the  relation  behveen  the  two  errors  can  be  written  genericaily  as 

||u{-,t)  -  f/(-,t)||  <  5(0  max  7J(s). 

where  5(/)  is  a  stability  factor  for  the  a  posteriori  error  bound  and  the  maximum  is  over  the  space-time  domain 
up  to  time  t.  5(0  is  determined  by  the  solution  of  the  dual  problem  to  the  differential  equation: 

f-^Os)  -  = /'(u)2.  t>s>0. 

\^(0  =  e(0/k(0l. 

with  e(t)  =  u{i)  -  U{i).  This  is  a  linear  problem  integrated  backwards  in  time  from  t  to  zero  obtained  hv 

linearizing  (3)  around  the  solution.  5(/)  is  given  as 

5(0=  f  \\zr{-.s]\\ds. 

Jo 

The  dual  problem  arises  naturally  when  considering  the  effect  of  perturbations  on  the  solution  of  a  differential 
equation  once  It  has  been  written  in  variational  form,  just  by  integrating  by  parts.  Of  course,  the  variational 
form  of  the  differential  equation  is  the  starting  point  for  a  Galerkin  finite  element  method  as  well 

There  IS  a  close  parallel  between  the.se  error  bounds  and  the  classic  error  bounds  on  the  numerical  solution  of  a 
sjstem  of  hnear  equations  Ax  =  b,  where  A  is  a  matrix  and  3-  and  h  are  vectors.  The  direct  approach  -  e.stimating 
the  error  |A  ~  x\\  -  is  of  course  not  po.ssible.  Instead,  the  residual  error  ||.4.V  -  6||.  which  is  computable  is 
introduced.  The  relationship  between  the  error  and  the  residual  is  determined  bv  the  condition  number  of  the 
matrix:  the  error  is  bounded  by  condition  number  times  residual.  The  stability  factor  plavs  exactly  this  role  for 
differential  equations;  it  is  the  “condition  number"  of  the  solution  of  the  differential  equation  that  we  approximate 
F  ie  stability  factor  is  not  explicitly  computable,  because  (5)  involves  information  about  the  solution  in  the 
initial  data  and  in  the  linearized  coefficient.  However,  the  dependence  appears  to  be  much  less  critical  than  in 
the  a  prion  error  bound  where  detailed  information  about  the  solution  and  its  derivatives  is  involved  In  the 
codes,  we  compute  approximate  stability  factors  using  the  numerical  approximation.  The  complete  mathematical 
justification  of  this  step  is  not  finished.  It  can  be  shown  that  a  good  value  of  5(0  can  be  obtained  bv  choosing  a 
small  number  of  random  initial  values  for  (5).  while  experimentally  it  appears  that  on  many  problems!  trajectories 
near  w,  including  the  approximation,  give  approximately  the  same  value  for  5(0-  In  other  words,  the  sensitivity 
of  all  solutions  near  u  to  perturbations  is  roughly  the  same.  This  kind  of  results  are  also  used  in  the  definition 
of  Lyapunov  characteristic  numbers  for  a  time  dependent  differentia!  equation.  In  [10],  we  report  on  a  set  of 
experiments  testing  the  effect  of  using  different  initial  data  and  linearizing  around  different  functions  on  the 
computed  stability  factor  S(t)  for  the  bistable  equation.  While  there  are  variations  in  the  values  of  5(t)  for  the 
different  computations,  in  every  case,  the  5(f)  measures  the  same  sensitivity  of  v  to  numerical  error  during  the 
critical  transient  periods.  ° 

We  plot  the  approximation  to  the  stability  factor  for  the  computation  on  the  bistable  equation  shown  in 
tig.  1  next  to  the  approximation.  5(f)  clearly  reflects  the  alternation  of  metastable  periods,  where  5(f)  is  of 
order  1  and  slowly  increasing,  with  the  transient  periods,  which  are  marked  by  the  very  rapid  increase  in  5. 
After  the  transient,  the  stability  factor  drops  precipitously,  indicating  that  the  subsequent  solution  is  essentially 
independent  of  accumulation  of  previous  errors.  When  the  solution  finally  converges  to  the  uniform  equilibrium 
state,  the  stability  factor  is  one,  and  all  previous  error  due  to  accumulation  is  removed.  Contrary  to  the  conclu.sions 
suggested  by  the  classic  a  priori  error  analysis,  the  bistable  equation  is  in  fact  computable. 
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5  Details  in  the  Case  of  the  Bistable  Equation 

§5.1  The  discretization 

w,  use  the  ^  ~  (jV 

to  then  thmUxva]  =  ^  1  r  _, 

of  the  domain  Si  =  [0, 1]  denoted  %  =  =  0  <  x„,,  <  ■  ■  ■  <  *»  «-  =it'  f  ,"'to  £  « 

denote  the  fth  mesh  length  and  /)„  denote  the  mesh  function  with  hn{x)  -  /in,,  for  a:  m  ^nn,;=ts 

Jrlodate  the  space  of  continuous,  piecewise  linear  functions  V,  in  x  The  approx, matron  space  ®(,)  cons,sts 
of  functions  that  are  piecewise  polynomials  in  time  with  coefficients  in  In  on 

=  KG-eVn. 

i=o 

Because  f  €  V{,)  is  generally  discontinuous  across  a  time  node,  we  use  the  notation  V;-  ‘*,T''The 

value  of  1-'  at  („  from  below  and  above  respectively,  and  ll  ).  to  denote  the  change  ,n  value  V  +  -  f„  .  The 
discontinuous  Galerkin  method  reads:  Compute  U  mV  such  that  for  n  -  1,2,  - 

jf  ,£;„r)d(  + (TV, Cr)d(  +  (!'„t..>'t-.)  =  (f'.r-, m 

for  all  polynomials  r  in  t  of  degree  ,  or  less  with  roefflcients  in  K.,  where  (».u)  =  cr*.  In  the  rase  ,  =  0 
for  example,  this  yields:  compute  U„  in  V„  satisfying 

{i:,7,r)  +  {(U£)..v,)k„  -  {f{U,7),v)k„  =  {U,7.:,v)  for  all  r  in  V;,. 

This  is  simply  the  well-known  implicit  Euler  approximation. 

§5.2  The  a  posteriori  error  bound 
With  II  •  II  denoting  the  root  mean  square  norm  on  Q  and 

|]r||/  =  max|lr(-,/)||: 

fS/r, 

the  a  posteriori  error  bound  for  the  piecewise  constant  approximation  q  =  0  reads 

|ie,7ll  <  5(^n)max7^.(^^m), 


7^(^, m)  =  Ciii/r?„/(r)ik  +  C2||/.r„D2(r )ii/„  -f 

+  C,\\kMU)\U^  +C5l|[fnn.|i  +  C'6ll^[T']n!r.  (8) 

for  some  constants  Q  ~  1.  where  A =  f^r|[r. 

£>2(t')!tx,_.,r,]  =  ■^max{|C/rl[r„x._,]  “  t^r|[r.-i,x.-i)|.  lt^xl[x.,x.+i]  “  t^x|[r,_..x.ll } 

xi-  lx.  ‘Wr,nd”  derivative  of  U  and  the  starred  term  is  present  only  if  the  mesh  is  changed  across  t„-i. 
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§5.3  Approximation  of  the  stability  factors 

We  compute  approximate  stability  factors  5p(/n,V')  by  applying  the  finite  element  method  to  tlie  linear 
problem  (5)  obtained  by  linearizing  around  the  approximation  U  and  using  a  guess  for  the  initial  condition 
f(0/l«(OI-  appropriate  order  quadrature  formula  to  compute  S(t)  from  this.  Since  (5)  is  linear  and 

moreover  since  we  are  not  interested  in  obtaining  a  very  accurate  approximation  of  the  solution  of  (.5),  but  only 
in  obtaining  S{t)  within  in  an  order  of  magnitude,  this  backward  integration  is  generally  much  cheaper  than  the 
forward  integration. 


§5.4  Adaptive  error  control 

The  algorithm  for  adaptive  error  control  is  essentially  an  iterative  method  for  computing  the  solution  of  a 
constrained  minimization  problem:  Given  a  tolerance  TOL,  compute  an  approximation  U  satisfying  ||u(-,t)  - 
^'"(■>011  <  TOL  ,  for  f  >  0  using  as  little  computational  work  as  possible.  TOL  is  the  global  error  tolerance.  We 
replace  the  ideal  goal  of  this  problem  by  the  practical  goal  of  computing  U  satisfying 

5(tn)maxi|7^([^^n)l|  <TOL  for  n  >  0,  (9) 

while  using  as  little  work  as  possible.  We  begin  with  an  initial  coarse  discretization,  compute  the  approximation, 
compute  an  approximation  to  5(/).  then  refine  or  coarsen  the  mesh  as  appropriate  in  order  to  achieve  (9).  The 
actual  refinement  decisions  are  based  on  the  sizes  of  the  individual  terms  in  11.  In  practice,  the  error  control 
generally  achieves  (9)  with  a  couple  of  iterations. 

§5.5  The  linear  solver 

The  discrete  linear  equations  that  are  solved  at  each  time  step  to  get  U  are  in  general  non-symmetric.  non- 
positive-definite.  and  not  diagonally  dominant.  To  solve  such  systems  efficiently,  we  u.se  the  QMR  (Quasi-Minimal 
Residual)  method.  QMR  is  a  Krylov-subspace  technique  which  requires  only  the  operations  of  multiplication  by 
the  matrix  and  its  transpose,  and  a  scalar  product.  This  allows  the  code  to  be  con.struct.ed  using  a  matrix-free 
approach  in  which  the  matrix  for  the  discrete  systems  are  never  assembled,  making  both  the  memory  management 
and  efficient  implementation  on  parallel  computers  easier.  See  [10]. 

§5.6  Numerical  example 

We  show  the  details  of  the  computation  presented  in  Fig.  1  using  CARDS.  In  Fig.  2(a).  we  plot  the  finite 
element  approximation  V  at  times  t  =  11.11  and  t  =  55-56.  The  second  time  is  during  the  metastabie  period  after 
the  first  well  has  collapsed.  We  can  see  that  there  has  hardly  been  any  motion  in  the  second  well.  In  Fig.  2(b), 
we  plot  the  mesh  function  fi(r)  versus  t  for  the  same  two  times.  The  computation  was  started  with  a  uniform 
mesh  with  40  elements,  and  we  can  see  how  the  error  control  refined  the  mesh  to  accurately  represent  the  wells 
from  the  beginning.  We  can  also  see  the  mesh  has  been  coarsened  after  the  first  well  collapses.  A  convenient 
measure  of  the  “amount  of  collapse"  at  time  t  is  ||1  -  (•,  /)t|.  We  plot  this  function  in  Fig.  3(a).  In  Fig.  3(b),  we 
plot  the  number  of  elements  versus  lime  where  we  can  clearly  see  the  effects  of  coarsening  after  each  transient.  In 
Fig.  3(c),  we  plot  the  time  steps  versus  time.  Again,  the  transient  regions  are  marked  by  a  substantial  decrease 
in  the  time  step. 

6  Conclusion 

Reaction-diffusion  problems  present  an  interesting  challenge  to  engineers  and  mathematicians.  The  nonlinear 
nature  of  the  problems  and  the  generic  tendency  towards  multiple  scales  in  the  space-time  behavior  of  solutions 
makes  the  precise  mathematical  analysis  difficult  and  numerical  analysis  is  one  of  the  main  tools  for  exploration 
of  the  solutions.  For  this  reason,  it  is  important  to  provide  the  means  to  estimate  the  error  of  numerical  results. 
Accurate  estimates  are  difficult  to  obtain  a  priori,  so  we  pursue  the  development  of  an  a  posterior  error  analysis 
that  measures  the  error  in  terms  of  computable  quantities  that  require  only  the  approximation.  This  analysis 
measures  the  local  production  of  error  in  terms  of  the  residual  error,  i.e.  the  remainder  left  from  substituting 
the  approximation  into  the  differential  equation,  and  measures  the  accumulation  of  errors  by  a  computational 
stability  factor.  We  described  this  theory  as  applied  to  the  bistable  equation,  which  is  a  simple  reaction-diffusion 
equation  that  has  solutions  with  multiple  scales  in  their  space  and  time  behavior. 
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Figure  2:  (a)  The  finite  element  approximation  (Ut  Um«  '  =  ^ fll^rnte.Tvelsue The 

Terr^rr'rrtirreThT  »  ..e  region  arounfi  the  .e„  on  the  let. 

after  it  has  coJlapsed. 


(a) 


tb) 


rignre  3:  (a,  A  plot  of  ||1  -  f — htglTrru::^:^  (c. 
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Error  estimates  for  subgridded  FDTD  schemes 

Peter  Monk  (monkOmath. udel.edu) 

Department  of  Mathematical  Sciences 
University  of  Delaware 
Newark,  DE  19716 

Local  meshing  or  sub-gridding  has  been  advocated  by  a  number  of  authors  as  a  way  of 
increasing  the  spatial  resolution  of  the  finite  difference  time  domain  method  (FDTD),  see  for 
example  [1,2, 3, 4],  The  concept  is  also  used  in  other  areas,  for  example  in  wave  propagation 
[5,6],  and  some  analysis  has  already  been  performed  [5,6,7]. 

In  this  talk  (which  reports  on  work  in  to  appear  in  Journal  of  the  ACES),  we  start  by 
giving  a  standard  analysis  of  the  spurious  reflection  that  occurs  at  an  interface  between  two 
grids  for  the  usual  FDTD  scheme.  This  serves  as  a  baseline  to  compare  our  analysis  of 
subgridding  schemes.  We  next  show  how  to  use  supraconvergence  techniques  (see  [8])  to 
analyze  the  error  in  a  simple  sub-gridding  strategj^  in  two  dimensions.  This  strategy  is  based 
on  constant  interpolation  (or  one  sided  integration  if  the  method  is  viewed  as  a  finite  volume 
scheme).  Surprisingly  the  method  is  found  to  converge  at  least  at  order  3/2.  Experimentally, 
the  rate  is  second  order  which  is  the  same  as  for  the  standard  Yee  scheme. 

To  give  a  more  detailed  picture  of  the  error  induced  by  the  step  change  in  the  grid,  we 
also  analyze  the  spurious  reflection  that  occurs  at  an  interface  between  two  grids  of  different 
size  for  the  simple  subgridding  method  and  for  another  subgridding  scheme  employing  linear 
interpolation  (like  [2]).  The  overall  order  of  convergence  of  the  reflection  coefficients  is  the 
same  for  all  the  methods,  but  the  linear  scheme  has  a  lower  amplitude  spurious  transmitted 
mode  compared  to  the  simple  subgridding  scheme. 

We  expect  our  analysis  to  carry  over  to  the  FDTD  discretization  of  the  full  three  dimen¬ 
sional  Maxwell  system,  and  an  effort  to  carry  out  this  analysis  is  under  way.  The  inclusion 
of  time  stepping  error  in  the  analysis  is  also  necessary  (in  this  context  see  [8]). 
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Applications  of  Numerical  Linear  Algebra  in  Electromagnetics^ 

Griffin  K.  Gothard  John  H.  Henderson  A.  Scottedward  Model 
Abstract 

Numerical  solution  of  electromagnetics  (EM)  problems  is  typically  accomplished  through  (1)  differen¬ 
tial  equation  methods  or  (2)  integral  equation  methods.  The  first  two  of  these  methods  generally  require 
the  solution  of  a  linear  system  of  n  equations  in  n  unknowns  Ax  =  6  in  order  to  obtain  an  approximate 
solution  to  the  imderlying  EM  problem.  We  first  present  a  discussion  of  these  methods,  followed  by 
a  survey  of  some  common  numerical  techniques  used  in  the  solution  of  the  linear  system  of  equations 
Ax  =  b. 

1  Introduction 

Practical  solution  methods  for  typical  problems  in  electromagnetics  (EM)  require  numerical  solution  tech¬ 
niques.  Many  of  these  numerical  schemes  require  the  solution  of  a  linear  system  of  n  equations  in  n  unknowns 

Ax  =  b  beC^  (1.1) 

in  order  to  obtain  an  approximate  solution  to  the  original  EM  problem.  We  present  first  a  survey  of  numerical 
techniques  used  in  electromagnetic  applications  to  generate  linecir  systems  of  equations,  and  second  common 
numerical  techniques  used  in  the  solution  of  equation  (1.1). 

2  Numerical  Methods  in  Electromagnetics 

EM  problems  are  stated  in  terms  of  Maxwell’s  equations  (partial  differential  equations)  whose  solution 
space  is  inherently  infinite  dimensional.  In  order  to  permit  numerical  solution,  it  is  necessary  to  reduce  the 
problem  to  one  of  finite  dimension  through  direct  discretization  of  the  solution  space  or  by  discretizing  the 
partial  differential  operators  in  Maxwell’s  equations.  This  procedure  is  implemented  as  either  (1)  differential 
equation  (DE)  methods,  or  (2)  integral  equation  (IE)  methods 

Several  popular  algorithms  are  available  to  generate  solutions  to  EM  problems.  Except  for  some  of  the 
high  frequency  techniques,  they  all  solve  either  differential  or  integral  equations  in  one  form  or  another. 
Since  only  differential  or  integral  equations  are  involved  in  the  final  solution,  the  methods  described  here 
should  be  of  considerable  interest  to  other  areas  of  science.  The  underlying  constants  describing  the  physics 
of  the  system  may  differ,  but  the  mathematical  equations  are  similar.  In  fact,  other  branches  of  engineering, 
such  as  acoustics,  heat  transfer,  and  fluid  dynamics  have  developed  similar  methods  [1]. 

2.1  Differential  Equation  Methods 

In  the  differential  equation  solution  approach,  we  formulate  a  set  of  differential  equations  to  describe  the 
scattering  mechanism.  We  divide  the  problem  space  into  regular  or  prismatic  zones  and  apply  discretized 
forms  of  the  differential  equations  at  each  zone  to  generate  a  linear  system  of  equations.  An  approximate 
solution  to  the  original  EM  problem  is  obtained  through  application  of  the  appropriate  boundary  conditions 
followed  by  the  solution  of  the  corresponding  equation  (1.1). 

In  open-region  DE  type  problems,  a  further  reduction  in  problem  dimension  is  required,  since  the  solution 
space  is  unbounded.  The  problem  space  must  be  artificially  terminated  using  some  type  of  truncation 
condition.  Since  the  fields  generally  are  not  zero  beyond  the  truncated  boundary,  we  must  introduce  a  set  of 
artificial  boundary  conditions,  such  as  absorbing  boundary  conditions  (ABC),  boundary  integral  formulation 
(BIF),  or  the  measure  equation  of  invariance  (MEI).  This  termination  of  the  space  grid  at  a  finite  distance 
introduces  further  inaccuracies  in  the  problem  solution. 

Three  examples  of  DE  methods  are  the  finite  difference  (FD)  method,  the  finite  element  (FE)  method, 
and,  when  using  the  integral  form  of  Maxwell’s  equations,  the  finite  integral  technique  (FIT). 

^The  authors  are  with  the  department  of  Electrical  Engineering,  Auburn  University,  Alabama,  36849. 
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Finite  differences  The  finite  difference  method  was  first  developed  by  A.  Thom  [2]  in  the  1920s  to  solve 
nonlinear  hydrodynamic  equations.  Since  then,  the  FD  method  has  found  applications  in  many  different 
fields  for  a  wide  range  of  problems,  including  computational  electromagnetics.  The  method  is  based  upon 
approximations  of  the  partial  differential  equations  (PDE)  with  finite  difference  ^nations. 

In  the  FD  method,  the  finite  difference  equations  relate  a  given  unknown  with  neighboring  unknowns. 
This  is  accomplished  by  dividing  the  problem  space  into  discrete  points,  or  nodes,  and  allowing  each  node  to 
represent  an  unknown  field  quantity.  Each  unknown  field  quantity  is  surrounded  by  neighboring  unknowns, 
or  known  boundary  conditions  which  define  individual  problems  within  a  given  class  of  problems.  Then,  the 
particular  PDE  is  estimated  numerically  with  standard  finite  difference  equations,  e.g.. 


. .  f  (do  +  Aa)  -  f{ao  -  Aa) 

/  («o)  -  (2Aa) 

(2.1) 

and 

. f(ao  +  Aa)-2/(ao)+/(ao-Aa) 

/  -  (Aa)2 

(2.2) 

Equations  (2.11  and  (2.2)  are  readily  derived  through  a  truncated  Taylor  series  expansion;  see,  e.g.,  [3]. 

In  frequency  domain  computational  electromagnetics,  equation  (2.2)  is  used  to  approximate  the  second 
spatial  derivative  in  Laplace’s  or  Poisson’s  equation.  This  results  in  the  numerical  analysis  tool  called  the 
finite  difference  (FD)  method.  Since  equation  (2.2)  is  a  discrete  approximation  to  the  second  derivative, 
problems  with  no  easily  obtained  analytical  solution  can  be  broken  down  and  analyzed  on  a  discrete  grid. 
Thus  an  approximate  solution  can  be  found,  which  may  closely  resemble  the  unique  solution. 

Finite  elements  The  finite  element  method  was  initially  developed  by  Courant  [4]  in  1943,  and  in  the 
1950’s  began  to  see  use  in  aircraft  design.  However,  the  method  was  not  applied  to  electromagnetics  until 
1968.  A  discussion  of  numerical  issues  in  the  finite  element  method  is  presented  in  [5]. 

The  finite  element  method  divides  the  domain  of  the  problem  space  into  subdomains,  or  elements.  (For 
the  two-dimensional  case  these  elements  are  generally  rectangular  or  triangular  in  shape;  three  dimensional 
problems  typically  use  tetrahedrons  or  3-D  rectangular  grids.)  An  interpolation,  or  basis,  function  is  defined 
on  each  element  to  provide  an  approximation  to  the  unknown  solution  within  each  subdomain.  This  mter- 
polation  function  is  usually  defined  as  a  first  or  higher  order  polynomial.  Third,  a  system  of  equations  must 
be  assembled,  and  the  Rayleigh-Ritz  variational  or  Galerkin’s  methods  may  be  used  for  this  step.  Finally, 
the  resulting  system  of  sparse  equations  must  be  solved.  . 

As  a  simple  example,  assuming  we  have  discretized  a  static  two-dimensional  problem  domain  utihzmg 
triangular  elements,  or  subdomains,  the  interpolation  function  may  be  defined  as  a  first  order  polynomial 
over  each  subdomain  as  o\ 

Vn(a!,y)  =  A„x +  Cn  (2-3) 

where  A„,  Bn  and  Cn  are  constants,  and  Vn(s,  y)  is  the  unknown  potential  at  the  n**  triangular  subdomain. 
Next  we  must  generate  the  system  of  equations,  and  for  this  we  will  use  Galerkin’s  method. 

Galerkin’s  method  is  a  weighted  residual  method  that  attempts  to  reduce  the  residual  to  the  least  value 
at  all  points  in  the  problem  region.  Also,  the  weighting  function  is  chosen  to  be  the  same  as  the  interpolation 
function,  which  in  our  case  is  defined  in  equation  (2.3).  Following  this  procedure,  ^d  implementing  the 
appropriate  boundary  conditions,  a  system  of  n  sparse  equations  with  n  unknowns  will  be  obtamed,  which 
then  may  be  solved  to  determine  the  FE  method  generated  solution. 

Finite  integrals  The  finite  integral  technique  was  originally  applied  by  Rienen  [6]  in  1985  and  Lebaric  [7] 
in  1989.  This  method  directly  applies  the  Maxwell’s  equations  in  integral  form  to  a  discrete  grid. 

As  in  the  FE  method,  the  FIT  first  divides  the  domain  of  the  problem  space  into  subdomains,  or  elements. 
The  FIT  applies  the  integral  form  of  Maxwell’s  equations  integral  form  to  each  element,  utilizing  basis 
functions  that  explicitly  define  the  field  quantities.  The  result  is  a  single  equation  that  may  be  used  to  relate 
each  unknown  quantity  in  the  subdomain  to  the  neighboring  unknowns.  The  resulting  system  of  equations 
is  assembled  by  applying  the  equation  obtained  for  each  element  to  each  unknown  in  the  problem  domain. 
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This  procedure,  along  with  the  incorporation  of  the  appropriate  boundary  conditions,  is  used  to  generate  a 
system  of  n  equations  with  n  unknowns  (1.1),  which  is  solved  to  obtain  the  FIT  problem  solution. 

A  matrix  structure  The  matrix  A  in  linear  systems  of  equations  (1.1)  that  arise  from  FD,  FE,  and 
FIT  approaches  have  the  same  properties  (and,  in  fact,  may  be  identical  for  certain  geometries).  They 
are  very  sparse,  with  each  row  of  the  matrix  relating  each  node  or  edge  only  to  its  immediate  neighbors. 
Further,  the  off  diagonal  entries  0£j,  i  ^  j  on  each  row  typically  satisfy  |ai£|  >  |ay(,  i  ^  j;  that  is,  the 
magnitude  of  the  diagonal  entries  is  larger  than  that  of  the  off-diagonal  entries  on  the  same  row.  We  call  this 
property  diagonally  strong-,  this  is  a  weaker  condition  than  diagonal  dominance,  but  is  a  useful  property  for 
some  iterative  solution  methods  for  equation  (1.1).  The  matrices  usually  exhibit  a  banded  nature,  and,  for 
comparable  problems,  there  are  more  unknowns  thm  for  the  integral  equations  solutions  since  the  differential 
equation  techniques  require  discretization  of  the  space  surrounding  the  object  of  interest. 

A  key  observation  in  open-region  problems  is  that,  since  some  kind  of  mesh-truncation  scheme  is  required 
for  solution  by  differential  equation  methods,  these  truncation  techniques  often  destroy  some  desirable  char¬ 
acteristic  of  the  matrix  A  in  equation  (1.1).  For  example,  boundary  integral  methods  destroy  sparsity, 
absorbing  boundary  conditions  (ABC)  and  perfectly  matched  layer  (PML)  techniques  require  a  large  num¬ 
ber  of  unknowns  since,  for  good  results,  the  truncated  boundary  must  be  removed  far  from  the  scatterer. 
Our  experience  with  the  measured  equation  of  invariance  (MEI)  technique  applied  to  arbitrarily  shaped 
three-dimensional  objects  indicates  that  it  also  destroys  properties  of  the  matrix  A  such  that  solution  by 
iterative  methods  becomes  much  more  difficult. 

2.2  Integral  Equation  Methods 

In  the  integral  equation  solution  approach,  we  derive  integral  or  integro-differential  equations  which  describe 
the  problem.  The  most  popular  IE  method  is  the  method  of  moments  (MoM)  introduced  by  Harrington  in 
1967  [8]. 

In  the  method  of  moments,  unlike  in  the  DE  techniques,  only  the  body  of  the  object  in  question  is 
discretized.  For  a  perfectly  electrically  conducting,  or  homogeneous  penetrable,  scatterer,  only  the  surface  is 
discretized,  usually  into  rectangular  or  triangular  patches  [9,10].  An  inhomogeneous  penetrable  body  requires 
the  discretization  of  its  volume  into  rectangular  prisms  or  tetraihedrons  [11].  The  currents,  either  actual 
or  an  equivalent,  are  expressed  over  the  discretized  region  as  a  linear  combination  of  subdomain  basis,  or 
expansion,  functions  such  as  pulse  or  triangular  functions.  For  example,  the  currents  on  an  arbitrarily-shaped 

PEC  scatterer  whose  surface  is  discretized  into  triangular  patches  may  be  expressed  as  J(r)  = 

i=l 

where  n  is  the  number  of  edges  of  the  discretization  of  the  surface  that  are  bordered  by  two  surface  triangles, 
Xi  is  the  weight  of  each  basis  function,  and  /^(r)  is  the  basis  function  at  edge  i  given  by 

(  ZA^  Pi+ )  ^  ^  ^t+ 

=  reTi- 

[  0,  otherwise 

where  U  is  the  length  of  edge  i,  Ti+  and  Ti-  are  the  two  triangles  bordering  edge  i,  A£+  and  A,-  are  the 
areas  of  the  two  triangles  and  Pi+  and  Pi-  are  vectors  pointing  from  the  vertex  of  each  triangle  opposite 
edge  i  to  the  point  r  [12].  The  unknowns  are  Xi,  the  magnitude  of  the  current  crossing  each  edge.  We  thus 
require  a  method  of  constructing  an  nxn  impedance  matrix  A  and  a  right-hand-side  data  vector  6  in  (1.1). 
We  may  then  use  a  linear  system  solver  to  solve  for  the  unknown  vector  x. 

The  method  of  moments  (MoM)  expresses  the  problem  to  be  solved  in  the  form 

Lif)  =  g  (2.4) 

where  L  represents  an  integro-differential  operator  applied  to  function  /,  and  p  is  a  forcing  function.  It 
is  required  to  define  an  inner  product  <  f,g  >  and  choose  a  set  of  testing  functions,  =  l--.n-  If 
the  expansion  functions  are  subdomain  functions,  then  typically  the  testing  functions  are  also  subdomain 
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functions,  and  may  be  identical  to  the  basis  functions.  Note  that  a  set  of  entire  domain  basis  and/or 
testing  functions  may  be  applied  in  MoM.  However,  when  solving  three-dimensional  problems,  defining  such 

functions  becomes  difficult.  •  a 

We  substitute  the  linear  combination  of  expansion  fiinctions  into  equation  (2.4)  and 

=  9  (2.5) 

t=i 

results.  We  take  the  inner  product  of  both  sides  of  equation  (2.5)  with  each  weighting  function  wi,  generating 
n  equations  of  the  form  <  tn,,  L(/i)  >=<  wt,9>.  It  Mows  that  the  entries  of  the  matrix  A  and  data 

vector  6  satisfy  aji  =<  >,  and  6i  =<  >,  1  <  <  «•  „  ,  ,  r  . 

For  our  example,  we  derive  an  integro-differential  equation  to  find  the  operator  L  by  enforcing  the 
boundary  condition  ft  x  (.Ej  +  JE?,)  =  0  on  the  surface  of  the  scatterer,  resulting  in 

{—j(jA  —  V$)<on  =  —Ei,ian 


and  a  may  be  determined  from 


V  •  J  =  ~ju}(T 


(2.6) 

(2.7) 


where  S  is  the  surface  of  the  scatterer  and  R  is  the  distance  from  the  point  of  integration  to  the  observation 


point  r  [10].  u 

When  the  testing  is  applied,  we  obtain  an  =<  wi,Ai  >  +  <  >  and  6/  =<  wi,Ei  >  where  Ai 

and  are  determined  from  equations  (2.6)  and  (2.7)  by  substituting  the  basis  function  / ^  for  J. 

In  the  method  of  moments,  one  may  view  the  resulting  matrix  equation  as  relating  the  current  at  eve^ 
basis  function  to  the  currents  due  to  every  other  basis  function.  Although  a  comparable  problem  solved  by 
method  of  moments  typically  would  have  fewer  unknowns  than  a  differential  equation  solution,  one  of  the 
main  disadvantages  of  the  technique  is  that  it  results  in  a  full  matrix. 

The  system  matrix  from  MoM  has  other  properties  that  may  be  exploited.  For  one,  most  MoM  formu¬ 
lations  result  in  a  matrix  that  is  diagonally  strong  (see  the  previous  subsection).  This  generally  results  in 
a  numerically  stable  solution  x  to  equation  (1.1).  K  the  testing  functions  are  selected  to  be  identical  to  the 
expansion  functions  {wi  =  fi),  the  the  resulting  complex  matrix  A  in  equation  (1.1)  will  be  symmetric  (but 
not  hermitian).  Under  some  very  simple  special  cases,  other  matrix  properties  may  result,  such  as  being 
Toeplitz.  However,  these  typically  are  not  evident  in  most  problems  of  a  practical  nature. 


3  Numerical  Solution  Techniques 

Numerical  methods  for  the  solution  of  linear  systems  of  equations  (1.1)  that  arise  in  EM  problems  continue 
to  be  an  active  area  of  research.  Solution  methods  for  both  classes  of  problems  may  be  loosely  classified  as  (1) 
Direct  methods,  e.g.,  the  LU  decomposition,  where  the  inverse  operator  of  the  matrix  A  is  applied  (to  within 
machine  precision)  to  the  data  vector  b  in  order  to  obtain  the  solution  x  of  equation  (1.1),  (2)  General  %tera.Uve 
methods,  e.g.,  Gauss-Seidel  iteration  [13],  GMRES  [14],  QMR  [15],  or  (bi-)conjugate  gradients  [16],  [17]  [IS], 
which  require  no  knowledge  of  the  spectrum  (eigenvalues)  of  the  matrix  A,  or  (3)  Spectral  iterative  methods, 
such  as  alternating  direction  imphcit  iteration  (ADI)  [19],  [20],  or  successive  over-relaxation  tetjmques,  which 
require  some  knowledge  of  the  spectrum  of  A  or  of  its  diagonal  dominance,  if  any.  All  of  these  numerica 
methods  can  be  aided  by  appropriate  preconditioning  of  the  ori^nal  problem.  Useful  references  for  additional 
reading  are  [13]  (an  essential  reference  for  anyone  doing  numerical  computations),  [21],  and  [22]. 
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3.1  Issues  in  numerical  solution  of  linear  systems  of  equations 

The  primary  issues  of  import  in  the  numerical  solution  of  equation  (1.1)  are  those  of  problem  condition, 
solution  accuracy,  or  algorithm  condition,  algorithm  reliability,  and  computational  cost  We  discuss  each  of 
these  below. 

Problem  condition  refers  to  the  change  Ai  induced  in  the  solution  x  of  equation  (1.1)  by  small  changes 
A^  and  A6  made  in  the  original  problem,  i.e.,  {A  +  Aj4)(2  +  Aa;)  =  (6  +  A&),  where  [|Ayl|j  <§:  ljyl||  and 
||A&|j  jl&li.  If  small  perturbations  Ai4  and  A6  yield  relatively  large  changes  Aa;  in  the  perturbed  solution 
vector,  then  the  problem  (1.1)  is  called  ill-conditioned.  lU-conditioned  problems  are  inherently  difficult  to 
solve  accurately  on  a  digital  computer.  The  condition  of  a  numerical  problem  is  quantitatively  measured  by 
the  condition  number  /c(.A)  =  ||j4|j  /  ||.4“^[[  of  the  coefficient  matrix  A;  a  large  condition  number  relative  to 
machine  precision  indicates  that  inaccurate  results  may  be  expected  from  any  numerical  solution  procedures. 

Algorithm  condition  refers  to  the  error  Aa:  induced  by  a  computational  procedure  that  is  used  to  solve 
equation  (1.1).  Algorithm  condition  is  bounded  below  by  the  problem  condition;  a  poorly  designed  algorithm 
(e.g.,  Gaussian  elimination  without  partial  pivoting)  may  have  an  algorithm  condition  that  is  arbitrarily 
worse  than  the  problem  condition  of  equation  (1.1).  a;(€)  to  the  desired  nominal  problem.  That  is,  Gaussian 
elimination 

Algorithm  reliability  refers  to  the  ability  of  an  algorithm  to  detect  ill-conditioned  problems  or  potentially 
inaccurate  solutions.  A  condition  estimator  such  as  that  proposed  in  [23]  provides  a  convenient  heuristic 
to  improve  the  reliability  of  an  LU-decomposition  based  solver,  although  it  can  greatly  underestimate  the 
condition  number  of  a  matrix  [24]. 

The  computational  cost  of  an  algorithm  refers  the  the  required  computational  resources,  typically  mea¬ 
sured  in  terms  of  storage  requirements  or  floating-point  operations  (flops).  A  well  designed  iterative  sparse 
matrix  solver  may  require  relatively  little  storage  and  relatively  few  (say,  0(n))  flops  per  iteration  by  taking 
advantage  of  available  structure  in  the  matrix  A.  Conversely,  by  ignoring  matrix  structure,  an  otherwise 
useful  dense  matrix  solution  algorithm  will  require  O(n^)  storage  and  0(n®)  flops  for  solution  of  equation 

(1.1) .  Since  the  dimension  n  of  a  typical  EM  problem  is  roughly  of  the  magnitude  of  10^-10'^,  it  is  clearly  in 
the  best  interest  of  the  investigator  to  carefully  exploit  what  structure  is  available  in  a  given  problem. 

3.2  Direct  methods 

Direct  methods  (LU  decomposition,  QR  decomposition)  are  typically  applied  in  cases  where  the  coefficient 
matrix  A  is  large  and  dense,  since  these  techniques  typically  do  not  respect  sparsity  or  other  available 
structure.  Most  methods  make  use  of  the  LU  decomposition,  since  this  allows  the  algorithm  to  make  use  of 
banded  structure  that  may  be  available  in  a  given  problem.  Due  to  the  high  dimension  n  of  problems  (1.1) 
involved  in  EM,  multiprocessor  (parallel)  supercomputers  may  be  used.  An  early  survey  of  parallel  solution 
methods  is  in  [25];  see  also  [26],  [27],  [28].  The  arcMtecture  of  the  machine  used  (SIMD  vs.  MIMD,  global 
memory  vs.  local  memory)  have  a  profound  impact  on  the  efficacy  of  a  given  algorithm  choice.  Numerical 
software  packages  such  as  LAPACK  [29],  the  successor  to  LINPACK  and  EISPACK,  have  been  optimized 
for  some  specific  architectures  and  are  available  on  many  supercomputers. 

3.3  General  iterative  methods 

When  the  matrix  A  in  equation  (1.1)  is  sparse,  computational  costs  render  the  application  of  a  direct  solver 
impractical.  Iterative  solution  procedures  compute  a  sequence  of  solutions  xo,xi,...  that,  in  the  absence 
of  round-off,  converge  to  the  exact  solution  x  of  equation  (1.1).  Iterative  solution  methods  are  typically 
based  on  either  a  recurrence,  e.g.,  conjugate  gradients  (CG)  [17],  generalized  conjugate  gradients  [16],  or 
biconjugate  gradients  (BCG)  [18],  or  on  projection  of  the  problem  (1.1)  into  a  reduced  dimensional  subspace, 
e.g.,  GMRES  [14]  or  QMR  [15].  A  common  factor  in  each  of  these  methods  is  that  they  only  require  that  it 
is  possible  to  efficiently  form  the  matrix-vector  product  y  =  Av  for  arbitrary  vectors  u  €  C".  In  particular, 
these  methods  do  not  require  knowledge  of  the  spectrum  (eigenvalues)  of  the  matrix  A. 

The  CG  method  [17]  is  an  effective  method  for  the  solution  of  equation  (1.1)  when  the  matrix  A  is 
hermitian  positive  definite.  In  the  absence  of  numerical  roundoff,  the  CG  method  is  known  to  converge  to 
the  solution  x  of  equation  (1.1)  in  at  most  n  iterations.  If  A  is  not  hermitian,  one  may  premultiply  equation 

(1.1)  by  the  complex  conjugate  transpose  A^  of  the  matrix  A  to  obtain  the  normal  equations  A^Ax  =  A^b, 
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that  may  then  be  solved  by  the  CG  method.  Such  an  approach  is  inadvisable:  (1)  due  to  numerical  roundoff, 
convergence  to  the  solution  x  of  equation  (1,1)  rarely  occurs  within  n  iterations.  Second  the  two-norm 
induced  condition  K2{A^ A)  =  K2{A)\  and  so  solution  via  the  normal  equations  is  inherently  nui^ncally 
less  stable  than  solution  via  equation  (1.1).  Thus,  the  CG  method  has  typically  come  to  be  regarded  as  an 
iterative  method. 

3.4  Spectral  iterative  methods  , 

Spectral  iterative  methods,  like  general  iterative  methods,  compute  a  sequence  of  solutions  xo,xu...  that 
converge  to  the  desired  solution  *  of  equation  (I.l).  However,  spiral  iterative  ^thoj 
knowledge  of  the  eigenvalues  of  the  matrix  A  or  of  the  splitting  A  =  M  +  N  used  in  the  spectral  method.  A 

survey  of  spectral  iterative  methods  is  in  [21],  [22].  ,,  ,  ,  ... 

The  principles  of  spectral  iterative  methods  may  be  illustrated  in  the  Jacobi  method  (or  the  related 
Gauss-Seidel  method).  Given  a  splitting  A  =  M+N  we  may  write  Mx  =  -Nx  +  b,  which  naturally  suggests 
the  recursion 

Mxk+i  =  -Nxk  +  b  vJ-1) 

that  forms  the  basis  of  the  Gauss-Seidel  iteration.  The  sequence  Xk  generated  by  equation  (3.1)  converges  to 
the  solution  ®  of  (1.1)  if  and  only  if  the  eigenvalues  Al, ...,  A„  of  the  matrix  he  inside 

speed  of  convergence  is  Umited  by  the  spectral  radius  p(M-^N)  =  max.  1A,|.  The  Successive  Over  Retotion 
(SOR)  method  employs  a  free  parameter  a;  in  a  modified  splitting  =  D  +  wL, 

Lch  that  the  spectral  radius  <  piM-^N).  The  alternating  direction  imphcit  (ADI)  iteration 

[19],  [20],  [30]  requires  a  spUtting  A~  M  +  N  such  that  MN  ~  NM  {M  and  N  commute)  and  that  th 
eigenvalues  of  the  matrix  A  be  in  a  known  region  in  the  closed  left  half  of  the  complex  plane.  In  this  case, 
a  sequence  of  parameters  pu  ...,Pk  are  computed  to  provide  a  solution  x*  that  is  within  an  a  pnon  known 
bound  from  the  exact  solution  x  of  (1.1). 

Other  recurrences  are  discussed  in  [31]. 

The  numeriSd  performance^of  linear  system  solvers  may  be  greatly  improved  by  preconditioning.  Numerical 
preconditioning  takes  two  forms:  (1)  balancing,  or  scaling,  in  which  a  diagonal  transformation  is  applied  to 
equation  (1.1)  in  order  to  improve  problem  condition  without  introducing  further  roundoflt  error,  and  (2) 
preconditioning  for  iterative  methods,  which  is  used  to  accelerate  convergence  of  iterative  algorithms. 

Balancing  When  the  condition  number  «(A)  of  the  matrix  A  in  equation  (1.1)  is  large,  then  it  is  often 
useful  to  perform  some  form  of  scaling  of  equation  (1.1)  in  order  to  compute  an  equivdent  problena  Ax  -  6 
in  which  K(i)  <  k(A)  and  from  which  the  solution  x  to  equation  (1.1)  may  be  readily  obtamed  from  the 
vector  X  For  example,  Parlett  and  Reinsch  [32]  present  an  algorithm  that  computes  a  diagonal  similarity 
transform  D  to  obtain  ix  =  b  where  i  =  D'MD,  b  =  Z)->6  and  D  is  selected  to  minimke  the  dyadic 
range  of  the  nonzero  entries  of  the  matrix  A.  Curtis  and  Reid  [33]  present  a  simile  ^gorithm  that  selects 
two  matrices  Di,  £>2  that  are  used  to  obtain  Ax  =  6  where  A  =  D1AD2,  b  -  Dib.  In  this  c^,  the  solution 
X  of  equation  (1.1)  is  obtained  as  x  =  D2~^x.  Numerical  balancing  is  appropriate  only  when  the  non^ro 
entries  of  the  matrix  A  are  known  to  roughly  the  same  relative  accuracy,  regardless  of  their  magnitudes. 

Algorithm  preconditioners  AU  iterative  methods  can  benefit  from  algorithm  preconditioning.  Algorithm 
preconditioning  differs  from  the  problem  scaling  discussed  above  §3.5  in  that  algorithm  preconditionmg  is 
applied  for  the  purpose  of  reducing  the  number  of  iterations  required  in  order  to  obtain  a  prescribed  de^ee 
of  accuracy  in  the  compute  solution  x.  Most  preconditioning  techniques  involve  a  splitting  of  the  matra 
A- M  +  N  where  the  matrbc  M  is  “easy”  to  invert  and  the  pair  M  and  AT  yield  some  desirable  properties 
for  the  given  algorithm,  e.g.,  computational  cost  relative  to  a  direct  solver,  in  addition  to  acceleratmg  the 
convergence  of  the  underlying  iterative  method. 
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4  Conclusions 

We  have  presented  a  brief  survey  of  applications  of  numerical  linear  algebra  in  the  solution  of  problems  in 
electromagnetics  (EM).  A  number  of  methods  are  available  to  reduce  the  infinite- dimensional  EM  problems 
into  finite  dimensional  linear  systems  of  equations  (1.1).  Care  must  be  exercised  to  recognize  the  structure 
and  properties  of  these  linear  systems  of  equations  prior  to  selecting  a  solution  procedure  for  numerical 
computation  of  the  solution  vector  x. 
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MULTILEVEL  PRECONDITIONING  FOR  THE 
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Abstract.  The  numerical  approximation  of  the  solution  of  the  time-harmonic  Maxwell  etjuations 
by  a  least-squares  finite  element  discretization  is  discussed  in  this  paper.  Our  emphasis  lies  in  lh<? 
efficient  solution  of  the  system  of  linear  algebraic  equations  arising  from  the  discretization.  Adcliti\-(’ 
multilevel  preconditioning  is  analyzed  theoretically  and  by  computational  experiments  for  a  simjfie 
two-dimensional  model  problem. 

1.  Introduction.  The  Maxwell  equations  for  a  time-harrnonic  electromagnetic 

field 

£{x.,t)  =  =  e-'’^‘H(x) 

in  a  domain  f>  C  are  given  bt’ 

.j  (-tu.'£  +  (j)E- VxH  =  J, 

— tu-’/rH  -p  V  X  E  =  0  . 

Here  ;  denotes  the  dielectric  constant,  p  the  permeability,  and  a  the  electric  conduc¬ 
tivity.  With  appropriate  boundary  conditions  and  under  certain  assumptions  on  f. // 
and  cr,  the  corresponding  boundary  value  problem  has  a  unique  solution  (cf.  e.g.  [7]  or 
[4,  Section  6]).  In  this  paper,  we  assume  that  is  a  bounded  polygonal  domain  and 
perfectly  conducting  boundary  conditions,  n  x  E  =  0  on  dO.. 

One  approach  to  the  numerical  .solution  of  this  problem  is  to  substitute  the  first 
equation  into  the  .second,  leading  to  the  second-order  equation 

(1-2)  V  X  (//T^V  X  E)  —  {uj^c  -b  zu-'<t)E  =  —iioJ 

for  the  electric  field  E.  The  finite  element  approximation  of  this  formulation  is  de.'^cribed 
in  [10].  If  one  is  interested  in  the  approximation  of  the  magnetic  field  H  as  well,  mixed 
finite  element  approaches  are  appropriate  (cf.  [9]).  .An  alternative  approach  which  is 
particularly  easy  to  implement  and  analyze  consists  in  the  first-order  system  least- 
squares  strategy  which  was  studied  for  second-order  elliptic  equations  in  [2].  [3].  Since 
the  Maxwell  equations  constitute  a  first-order  system,  it  is  natural  to  use  the  least- 
squares  approach  here.  Closely  related  is  the  approach  taken  in  [6]  where  the  Helmholtz 
equation  is  rewritten  as  a  first-order  system,  and  the  least-squares  finite  element  method 
is  applied. 

Our  primary  aim  in  this  paper  is  to  show  that  additive  multilevel  preconditioning 
is  feasible  for  the  least-squares  formulation  of  Maxwell’s  equations.  This  will  be  ex¬ 
plained  theoretically  in  the  following  three  sections  and  illustrated  by  computational 
experiments  in  Section  5. 
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2.  The  Least-Squares  Functional.  In  the  sequel,  we  will  need  the  space 
Vq  =  {v  e  //^(curl;  fi)  :  n  x  v  =  0  on  (90} 

{see  [5,  Theorem  2.11]  for  a.  proper  definition  of  the  trace  operator  tissociated  with 
these  boundary  values).  A  solution  of  (1.1)  subject  to  perfectly  conducting  boundaTv 
conditions  obviously  minimizes  the  least-squares  functional 

(2.1)  G'o(E,H;J)  =  |[(_ia;ff  +  (7)E  -  V  x  H  -  J||5^n  +  II  -  +  V  x  Elj^  cj 

among  all  (E,  H)  €  Vq  x  iif(curl;  0).  Vice  versa,  a  unique  solution  of  (1.1)  is  also 
characterized  by  the  unique  minimum  of  (2.1).  By  applying  the  divergence  opernlor  l  o 

(1.1) .  we  obtain  (using  the  fact  that  V  •  (V  x  v)  =  0) 

V  •  ((— znuf  +  cr)E)  =  V  •  J  and  V  ■  H  =  0 

(assuming  that  J  G  //(cliv;0)).  Since  these  two  equations  are  also  sati.sfiecl  b\-  liu- 
solution  of  (1.1),  we  may  use  them  to  augment  the  least-squares  functional  (2.1 ).  IToni 
here  on,  for  notational  convenience,  we  assume  that  (j  =  0  in  f>.  Note  that  this  is 
usually  the  most  difficult  case  since  (1.1)  may  be  singular  for  certain  values  of  (liu' 
interior  Maxwell  eigenvalues;  cf.  [7],  [10]).  The  augmented  least-squares  functional  is 
given  by 

H;  J)  =  II  —  7U/'tE  —  V  X  H  —  J||on  "b  II  ~  -b  V  x  E||q  ^ 

+  !|V-(c-E)||g,^  +  ||V.(^H)||2^^. 

(dearly,  this  augmented  least-squares  functional  requires  in  addition  to  E.  H  G 
/f(curl;fl)  that  cE,/.(H  C  if(div;fi).  Moreover,  the  boundary  conditions  n  x  E  =  0 
together  with  the  first  equation  in  (1.1)  implies  n-H  =  0  on  dO.  which  we  might  impose 
as  boundary  conditions  for  H. 

For  the  purpose  of  exposition,  let  us  further  simplif}^  the  problem  and  set  £  =  //  =  L. 
The  least-squares  functional  is  then  given  by 

G'(E,H;J)  =  ||-?:^E-VxH-J||2^  +  ||-^u;H  +  VxE||;j^ 

+  l|V-E||^,,  +  ||V.H||^^^’. 

.4doptiug  the  notation  from  [5,  Chapter  I],  we  define  the  spaces 

V  =  {Hidiv;  n)  n  Ffo(curl;fi)) ,  W  -  {Ho{dW:n)  n  //(curl;  H))  . 
and  seek  to  find  (E,  H)  G  V  x  W  such  that 

(2.3)  G(E,H;J)=  min  G(E,H;J) 

(E,H)eVx'W 

holds.  With  the  associated  bilinear  form 
.F(E,H;E,H) 

(2.4) =  (-/cjE- V  X  H,-ia;E- V  x  H)o,n -b  (-fwH  +  V  x  E,-icoH-b  V  x  E)on 
+{V-E,V-E)o,n  +  (V-H,V.H)o,n 
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(note  that  the  complex  L'\Q)  inner  product  has  to  be  used  here),  the  minimization 
problem  (2.3)  is  equivalent  to  the  variational  problem 

(2.5)  :F(E,  H;  E,  H)  =  (J,  -iwE  -  V  x  H)o,n  for  all  (E,  H)  €  V  x  W  . 

Under  certain  assumptions  on  H  (cf.  [5,  Section  3.4]),  the  above  spaces  are  conUnuously 
embedded  in  and  we  may  pose  the  minimization  problem  in  x 

(/fUU))''  suitable  boundary  conditions)  instead. 

In  order  to  obtain  straightforward  finite  element  approximation  and  multilc^•el  con¬ 
vergence  results,  boundedness  and  ellipticity  of  the  bilinear  form  (2.4)  with  rcsiieci 
to  The  H^{n)-novm  in  all  variables  is  desirable.  Unfortunately,  this  is  nol  possible  fo 
achieve  here.  In  fact,  the  ellipticity  of  the  least-squares  bilinear  form  would  be  xmy  sur¬ 
prising  since  the  bilinear  form  associated  with  (1.2)  may  become  indefinite,  in  gcucral. 
This  indefiniteness  is  due  to  the  nature  of  the  problem  winch  couples  the  achievable 
finite  element  approximation  with  the  wave  number  a,'.  After  all.  the  singular  nature  ol 
the  problem  if  a.’  is  one  of  the  interior  Maxwell  eigenvalues  has  to  show  up  somewhere 
in  this  formulation.  The  variational  formulation  via  the  least-squares  approach  ax'oids 
the  problems  caused  by  indefiniteness  (for  instance,  the  conjugate  gradient  metliud  can 
lie  used  safely)  but  it  certainly  does  not  (and  cannot)  avoid  the  problems  canscxl  In- 
loss  of  ellipticitv.  Similarly  to  the  situation  for  indefinite  problems  like  (1.2).  a  weai«'r 
form  of  ellipticity.  a  so-called  Garcling^s  inequality,  holds  and  ensures  finite  (haueiit 
approximation  results  for  sufficiently  fine  triangiilations  depending  on  .n. 

For  what  is  to  folloiv.  it  is  convenient  to  introduce  the  norm  ||  ■  lUu.n  defined  1)\- 

(2.6)  llvIU-n.n  =  (iv|?.Q  +  ^''^||v|lo.o)  ^  for  v  €  {H\9)Y  . 

where  ]  •  |nQ  denotes  the  standard  (i/'(Q))^'-seminorm.  Note  first  that  the  boundedness 
of  the  least-squares  bilinear  form  (2.4)  with  respect  to  the  norm  (2.6),  i.e.. 

(2.7)  ;^(E,H;E,H)  <  -1  (l|Elli,,B  +  IIHII;, (|lE||e,,,,  +  ||H|1.:„P’''' 

(with  pi  not  depending  on  a.’)  follows  easily  from 

jr(E.H:E,H)  <  (J^(E.H;E,H)‘p:r(E,H:E.H)S2 


and 


/•(E.H:E.H) 

=  II  -  ico-E  -  V  X  +  11  -  ?u.’H  -f  V  X  E\\l^  +  ||V  •  E\\la  +  tl^  '  ^W'o.q 

<  2  (|E|pn  +  +  u,'^llE|lo_n  -f  . 

The  Garding-type  inequality 

jr(E.H;E.H) 

>  72  (llE|iP.a  +  IIHIIPP  -  73U.''^  (||E||2.„  +  IIHIlin) 
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(with  72,73  not  depending  on  in)  follows  from 

>  l|V  X  E||i  „  +  ||V  .  E||§,a  +  in“|lE||J,„  +  ||V  x  H||g,„  +  ||V  ■  H||i„  + 
-■2u.  (||E|lo,nl|V  X  H|lo.n  +  ||H||o,o||V  x  E||„,n) 


using 

2u;||Ello.n||V  x  H||o,fj  <  HV  x  H|jo  ^  +  u;^||E|lo  , 

2a>||H||o,n|(V  X  Elto,n  <  HV  x  Ejl^ 


and  the  continuous  imbedding  of  V  and  W  in 

3.  Finite  Element  Approximation.  For  the  numerical  approximation  of  the 
solution  of  the  Maxwell  equations  (1.1)  by  the  least-squares  finite  element  meiliod, 
we  choose  appropriate  finite- dimensional  subspaces  C  V  and  W'*  C  W.  I  be 
minimization  problem  (2.3)  is  then  solved  in  these  subspaces,  i.e.,  (E'hH^'}  G  V"  x  M'  " 
are  constructed  such  that 


(3  1)  r;(E'\H'hJ)=  min  ^E^H^J) 

^  '  (E'eH'qev''xw'‘ 

holds.  With  the  associated  bilinear  form  (2.4)  this  is  equivalent  to  solving  the  variational 
problem  of  finding  (E^h  G  x  W^’  such  that 

(3.2)  ^(E'hH'hE'hH'-)  =  (J,-/..'E"  -  V  x  H^')o,n  for  all  (E.H)  G  V’  x  W'’  . 


Note  that,  under  the  assumptions  on  f>  from  [5,  Section  3.4]  which  led  us  to  (he  definition 
of  the  bilinear  form  in  x  we  are  free  to  use  standard  contiuiunis 

linear  or  bilinear  finite  elements  with  the  appropriate  boundary  conditions. 

Finite  element  approximation  results  for  this  least-squares  approach  follow  along 
similar  lines  as  for  weakly  indefinite  (Helmholtz-type)  problems  using  an  obser\-ation 
by  Schatz  [11]  (see  also  Yserentant  [13]).  Let  be  a  quasi-uniform  familv  of  Iri- 

angulations  of  Q,  and  let  V^h  W'*  be  corresponding  finite  element  spaces.  Denoting  b>- 
(E,H)  G  V  x  W  the  minimum  of  (2.1)  (which  coincides  with  the  solution  of  (1.1)).  we 
mav  choose  the  mesh-size  H  such  that  there  exists  (E^,H'*^'^)  G  x  with 


||E-E^^fc,,a  +  I|H-H^ 


llu;:l,n  ~  .2 

^  fl  43^ 


PIIL  • 


Following  the  reasoning  in  [11]  and  [13]  we  then  obtain  that,  for  (E,H)  G  V  x  W 
satisfying 

jr(E,  H;  E^,  H^)  =  0  for  all  (E^,  H^)  G  x  , 


we  have 

(3.3)  nE,H;E.H)  >  ^  (||E||^.,i,„  +  |lH|iy ,„)  . 
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This  implies  that,  on  all  sufficiently  fine  triangulations  with  h  <  H  where  H  ~  !  l^. 

-  E\  H  -  E  ~  E\  H  -  ^  (l|E  -  +  l|H  -  H'^UL.a) 

holds,  i.e..  the  least-squares  bilinear  form  of  the  error  is  an  upper  bound  for  ibe 
X  [H^ {Q.))^-norm  of  the  error. 

We  finally  mention  an  observation  in  connection  to  the  least-squares  finite  ele¬ 
ment  approach  which  is  of  practical  importance  (see  also  [S]).  Since  the  least -sepia res 
functional  is  zero  at  the  solution  (E,H)  of  (1.1).  the  functional  evaluated  at  an  ap¬ 
proximation  (E\H'^)  e  X  serves  as  an  a  posteriori  error  measure.  This  follows 
from 

Cr'(E^H^hJ) 

=  G(E^H^J)-6■'(E^H^J) 

=  J^(E-E^H-H^E-E^H-H'’•) 

which  is  a  straightforward  consequence  of  the  usual  relation  between  a  quadratic  func¬ 
tional  and  its  associated  bilinear  form.  Howe\"er,  the  functional  measures  the  error  in 
the  norm  defined  by  the  bilinear  form  and  this  is  equivalent  to  the  IP  {Q)-non\)  only 
under  the  restrictions  discussed  above. 

4.  Multilevel  Preconditioning.  One  of  the  advantages  of  this  least-squares  a|)- 
proach  is  that  the  discrete  system  is  ahvays  positive  definite  and  the  (suitahlx-  pre¬ 
conditioned)  conjugate  gradient  method  can  be  used.  The  least-squares  finite  element 
method  leads  to  a  sequence  of  systems  of  linear  equations  associated  with  a  nested 
sequence  of  finite  element  spaces.  For  such  problems,  multilevel  methods  have  shown 
to  be  extremely  efficient.  This  efficiency  comes  from  the  fact  that  the  entire  hierarchy 
of  operators  associated  with  all  levels  of  the  discretization  is  used.  As  an  atieinpl  to 
get  an  efficient  solution  method,  we  combine  the  multilevel  approach  with  Krylov  sub¬ 
space  methods.  The  basic  idea  of  the  additive  multilevel  preconditioner  lyy  Bramlde. 
Pasciak,  and  Xu  [1]  (see  also  Yserentant  [14])  is  to  use  a  specific  scaling  of  the  residuals 
associated  with  different  discretization  levels.  Consider  a  sequence  of  triangulations 
{T’"',  I  =  0. ...  ,L}  (with  /?o  —  H,  h-i  —  h).  Associated  with  each  triangulatiou  [T''-  ]  is 
the  finite  element  space  x  .  This  leads  to  a  nested  sequence  of  spaces 

yho  X  C  X  C  •  •  •  C  x  . 

On  each  level  /,  0  <  /  <  T,  an  operator  :  V'’'  x  V''^'  x  W'"'  is  defined  b>- 

((j^KE,  H);  (E,  H)))  =  H:  E,  H)  for  all  (E,  H)  G  x  W'*'  , 
where  the  inner  product  ((•;•))  is  given  by 

(((E,  H);  (E,  H)))  =  (E,  +  (E,  H)om  • 

In  terms  of  the  operator  JF;,  the  discrete  problem  (3.1)  can  be  written  as 
(4.1)  J-,(EpH,)  =  J; 
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where  the  right-hand  side  is  defined  b}-  ((J;,(E,H)))  =  -(J,— ?'u;E-  V  x  H)o.o  ior  ali 
{E,H)  e  X  W^'. 

The  basic  ingredients  for  multilevel  preconditioning  are  the  projection  operators 
Vi.  Qt  :  X  W'*  — >  X  which  are  given  by 

H);  (E,  H))  =  ;^((E,  H);  (E,  H))  for  all  {E.  H)  G  x  W'" 

and 

((Q,(E,H);(E.H)))  =  (((E.H):(E,H)))  for  all  (E,H)  G  V”'  x  W'-T 
In  this  abstract  setting,  the  additive  multilevel  preconditioner  is  equivalent  to 

1=0 

and  this  leads  to  a  condition  number  for  the  preconditioned  system  /o(C;  ') 

which  is  Imunded  independently  of  L.  For  details  on  the  implementation  see  [1]  and  H  1]. 
Tlie  framework  of  mnltile-\-el  minimal  residual  methods  developed  in  [12]  allows  e.Ktia 
flexibilit.y  in  the  scaling  of  the  components  associated  tvith  different  levels.  It-xperiinents 
with  different  tveighting  strategies  in  order  to  optimize  the  convergence  depending  on 
u,'  are  currently  under  way  and  will  be  rej:)orted  elsewhere. 

5.  Computational  Experiments.  This  section  contains  computational  eximr- 
iments  for  a  simple  two-dimensional  model  problem  to  illustrate  the  performance  ol 
the  multilevel  preconditioner  for  different  values  of  the  wa.ve  number  u;  and  mesh-size 
h.  We  consider  the  Maxwell  equations  (1.1)  with  £  =  /f  =  1  on  the  unit  scpiare 
Q  =  (0, 1)  X  (0. 1)  C  This  two-dimensional  setting  implies  that  our  unknown  rnnc- 
tions  are  E]  (,Ti  ,  :r2),  E2(a’i, .I'i)  ^md  Esf.rt.  .^^2)  and  the  corresponding  Maxwell  equations 
are 

diE2  —  82  E-i  —  =  0 

—  —  iix:E\  —  J\_ 

d,H3-iujE2  =  Ji 
diEi  +  ^2^2  =  0  . 

The  perfectly  conducting  boundary  conditions  imply  that  Ei  =  0  on  that  part  ol  the 
boundary  with  ^2  =  0  or  X2  =  1  while  E2  =  0  for  a;i  =  0  or  .tq  =  1.  The  numerical 
results  reported  here  ^vere  performed  with  the  choice  Ji  =  J2  =  l/v/2  in  Q.  \Ae 
minimize  the  least-squares  functional 

,,  ,,r;(Ea,  E2,  fh:  Ji,  J2)  =  11^1^2  ~  52 El  -  +  II  -  ^2^3  -  iioE,  -  .7,  II50 

+  —  •/u;E2  —  J2|lo,n  +  + '^2E2||5  n 

among  all  (E{\E2,E3)  G  x  V2  x  V3'  consisting  of  piecewise  bilinear  finite  elements 
on  a  triangulation  of  H  into  squares  of  side  length  h.  The  boundary  conditions  for  Ei 
and  E2  are  built  into  the  spaces  and  V^- 
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Table  o.l  shows  the  minimum  value  of  the  least-squares  functional  in  dependence 
of  h  and  u.’.  Note  that  these  numbers  serve  as  a  measure  of  the  discretization  error  as 
explained  in  Section  3.  Due  to  our  choice  of  Ji  =  J2  =  1/  we  have  G'(0. 0, 0;  . ./, )  = 

1  which  means  that  we  can  interpret  the  numbers  in  Talrle  5.1  also  as  a  measure  for  the 
relative  discretization  error.  The  minimum  values  of  the  functional  seem  to  l)ehav('  like 
0{h')  for  sufficiently  small  h. 


T.\ble  .5.1 

Minimum  value  of  the  least-squares  functional 


h  1/16 


10 

•20 


5.43  -lO-'^ 
3.64  -lO-'^ 


30 


3.61  -10-2 


10 

•50 

60 


3.61  TO-- 
3.61  10-^ 
3.61  -10-- 


1/32  1/64 


3.62 

•10-'^ 

1.73 

■10"' 

1.66 

■10-' 

6.92 

■10-’^ 

1.80 

■10-' 

9.33 

1.80 

•10-' 

8.99 

•10-' 

1.80 

■10-' 

9.02 

■10-' 

1.80 

■10-' 

9.02 

•10-' 

1/1-2S  1/-2.56 


5.66 

•10- 

~ 

1.54 

■lo-' 

2.41 

•10- 

3 

6.86 

•10-' 

4.48 

•10- 

i 

1.64 

■10-' 

4.53 

•10- 

s 

2.59 

■10-' 

4.51 

•10- 

3 

2.03 

■10-' 

4.51 

■10- 

3 

2.34 

■10-' 

Listed  in  Table  5.2  is  the  number  of  iterations  of  the  multilevel  precondif  loued  ('(i 
method  required  to  reduce  the  initial  residual  by  a  factor  of  10“'k  The  mullilew'i  pre¬ 
conditioner  uses  the  exact  solution  of  the  corresponding  problem  .4o  on  a  7  x  7  mesh 
(/?u  =  1/6).  .^'ote  that  the  value  of  the  least-squares  functional  remains  aluK.rst  un¬ 
changed.  indicating  that  discretization  accuracy  is  reached,  after  much  fewer  itera!  ions. 


Table  .5.2 
Number  of  iterations 


h 

1/16 

1/32 

1/64 

1/128 

1/2-56 

10 

17 

25 

33 

36 

37 

20 

19 

36 

51 

59 

61 

30 

17 

31 

51 

82 

100 

40 

16 

29 

48 

65 

101 

50 

14 

26 

42 

70 

91 

60 

14 

25 

40 

57 

83 

These  limited  numerical  experiments  indicate  that  multileA'el  preconditioning  leads 
to  an  efficient  solution  method  for  the  time-harmonic  Maxwell  equations.  More  com¬ 
putational  experience  involving  three-dimensional  domains  and  complicated  geometries 
are  needed  to  get  a  better  judgment  of  this  approach.  These  experiments  are  currently 
under  way  and  will  l^e  reported  elsewhere. 
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Iterative  Solution  Methods  for  Ill-Posed  Problems 

D.  Calvetti  *  L.  Reichel  ^  Q.  Zhang  ^ 


Abstract 

The  discretization  of  ill-posed  prolems  often  gives  rise  to  large  ill-conditioned  linear  systems  of 
equations  whose  solution  requires  some  form  of  regularization.  We  describe  new  solution  method 
in  which  the  filter  function,  that  determines  the  regularization  of  the  problem,  is  approximated 
by  linear  combinations  of  Chebyshev  polynomials. 


1  Introduction 

The  numerical  solution  of  ill-posed  problems  is  a  challenging  task  that  arises  frequently  in  the 
context  of  inverse  problems.  Many  inverse  problems  in  physics  and  engineering  can  be  formulated 
as  Fredholm  integral  equations  of  the  first  kind. 

The  discretization  of  Fredholm  integral  equations  of  the  first  kind  typically  yields  very  ill- 
conditioned  linear  systems  of  algebraic  equations.  When  the  dimensions  of  the  linear  system  of 
algebraic  equations  are  small  enough  to  allow  factorization  of  the  matrix,  an  approximate  solution 
of  the  system  of  equations  can  be  computed  by  algorithms  that  use  a  truncated  singular  value 
decomposition  or  other  orthogonal  transformations  of  the  matrix.  However,  many  inverse  problems 
that  are  of  interest  in  applications  give  rise  to  linear  systems  of  algebraic  equations  so  large  that 
solution  methods  requiring  the  factorization  of  the  matrix  aire  unfeasible  or  undesirable.  Such 
problems  have  to  be  solved  by  iterative  methods. 

We  describe  new  iterative  methods  for  the  solution  of  discretized  ill-posed  problems.  The  new 
methods  are  particularly  appealing  when  the  discretized  ill-posed  problem  is  very  large  and  the 
resulting  matrix  is  sparse  or  structured.  The  proposed  algorithms  do  not  require  that  the  matrix  be 
explicitly  stored,  only  a  subroutine  for  the  computation  matrix-vector  products  has  to  be  supplied 
by  the  user.  Generalizations  of  the  methods  in  this  paper  to  linear  systems  with  a  nonsymmetric, 
and  possibly  nonsquare,  matrix  are  described  by  Calvetti  et  al.  [2]. 

Let 

Ax=:g,  x,geR^,  (1) 

be  a  linear  system  of  algebraic  equations  that  stems  from  the  discretization  of  an  ill-posed  problems. 
Throughout  this  paper  we  cissume  that  the  matrix  A  be  symmetric.  Typically,  the  eigenvalues  of 

‘Depeirtment  of  Pure  and  Applied  Mathematics,  Stevens  Institute  of  Technology,  Hoboken,  NJ  07030.  E-mail: 
caJvetti@mathrsl.stevens-tech.edu 

^Depairtment  of  Mathematics  and  Computer  Science,  Kent  State  University,  Kent,  OH  44242.  E-mail: 
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^Department  of  Mathematics  and  Computer  Science,  Kent  State  University,  Kent,  OH  44242.  El-mail: 
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symmetric  matrices  arising  from  discretizing  ill-posed  problems  cluster  at  the  origin.  This  causes 
the  solution  vector  of  the  linear  systems  (1)  to  be  very  sensitive  to  perturbations  in  the  right-hand 
side  vector,  e.g.,  due  to  measurement  errors.  Surveys  on  the  treatment  and  solution  of  ill-posed 
problems  can  be  found  in  the  work  by  Engl  [3],  Groetsch  [4],  Hanke  and  Hansen  [5]. 

A  regularization  method  replaces  the  linear  system  (1)  with  a  nearby  system  whose  solution 
is  less  sensitive  to  the  presence  of  noise  in  the  right-hand  side  vector.  The  most  well-known  regu¬ 
larization  method  is  Tikhonov  regularization.  Instead  of  computing  the  solution  of  (1),  Tikhonov 
regularization  solves  the  minimization  problem 

min{||5-Aa:f +  i||Lx||2},  (2) 

x€R^  P 

where  ^  >  0  is  a  regularization  parameter  and  L  is  a  linear  functional  which  defines  a  seminom. 
For  simplicity  we  assume  that  L  =  /.  Here  and  throughout  this  paper  j|  •  ||  denotes  the  Euclidean 
vector  norm  or  the  corresponding  induced  matrix  norm.  Introduce  the  spectral  decomposition 

A  =  UAU'^, 

where 

U  =[ui,U2,.  ■  .,Un\,  U'^U  =  I, 

A  =  diag{Ai,  A2, . . . ,  A^},  Ai  <  A2  <  •  ■  •  <  A„ 

The  solution  of  (2)  satisfies 
-b  i/)x^  =  Ag, 

hence 

j=l  ^ 

where 

The  function  ^{t)  =  1  -  “filters  out”  from  the  solution  vector  contributions  along  eigenvectors 
corresponding  to  eigenvalues  of  small  magnitude.  For  this  reason  $  is  usually  referred  to  as  the 
Tikhonov  filter  function. 

The  selection  of  a  different  filter  function  determines  a  different  regularization  method.  Since 
the  purpose  of  a  filter  function  is  to  exclude  components  along  eigendirections  associated  with 
the  eigenvalues  closest  to  the  origin,  the  filter  function  usually  vanishes  at,  and  is  small  in  a 
neighborhood  of,  the  origin.  In  this  paper  we  consider  regularization  methods  defined  by  the 
Tikhonov  filter  function  and  by  the  exponential  filter  function  $(t)  =  1  -  defined  by 

The  function  (5)  is  analytic  in  the  whole  complex  plane  and  therefore  can  be  well  approximated  by 
polynomials  on  a  finite  interval.  When  \l3i^\  is  small  the  exponential  filter  function  approximates 
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closely  the  Tikhonov  filter  function.  Notice  that  the  Tikhonov  filter  function  has  singularities  at 
t  =  ±2/3“^/^,  where  i  = 

This  paper  is  organized  as  follows.  In  section  2  we  define  new  iterative  methods  based  on  the 
approximation  of  the  filter  function  by  a  linear  combination  of  Chebyshev  polynomials  of  the  first 
kind.  The  degree  of  the  approximating  polynomial  increases  by  one  at  each  iteration.  Since  the 
derivative  of  the  residual  polynomials  of  the  iterative  methods  vanishes  at  the  origin,  the  methods 
converge  to  the  minimum  norm  solution  when  the  matrix  A  is  singular  and  the  linear  system 
(1)  is  inconsistent.  Section  3  describes  how  to  compute  a  regularization  parameter  that  satisfies 
Morozov  discrepancy  principle  in  combination  with  the  iterative  methods.  Section  4  describes  how 
to  efficiently  compute  a  correction  to  the  solution  corresponding  to  a  previously  selected  value  of 
the  regularization  in  order  to  obtain  the  solution  corresponding  to  a  new  value  of  the  regularization 
parameter. 

2  Iterative  methods 

Assume  for  simplicity  that  =  0.  The  solution  of  (1)  by  an  iterative  methods  proceeds  by 
computing  a  sequence  of  iterates  of  the  form 

A:  =  l,2,...,  (6) 

where  gk-i  is  a  polynomial  of  degree  A:  —  1.  The  residual  vectors  =  9  —  satisfy 

rf^=Pk(^)9,  A  =  1,2,...,  (7) 

where  the  polynomials 

Pkit)  =  I  -  (8) 

are  generally  referred  to  as  residual  polynomials.  It  follows  from  (3)  and  (7)  that  the  polynomials 
qk~i  should  be  chosen  so  that  the  residual  polynomials  pf.  yield  a  good  approximation  of  <l>^{x) 
over  the  spectrum  of  the  matrix  A.  Furthermore,  since  the  eigenvalues  of  A  are  very  close  to  the 
origin,  it  is  desirable  that  the  derivative  of  the  residual  polynomials  vanish  at  the  origin,  i.e., 

P5t(0)  =  0.  (9) 

We  remark  that  an  iterative  method  must  satisfy  (9)  to  converge  to  the  solution  of  minimum  norm 
when  the  matrix  A  is  singular:  see  [1,  5].  Assume  that  an  interval  [a,  6]  containing  the  spectrum  of 
A  is  known.  The  Chebyshev  polynomials  of  the  first  kind  for  the  interval  [a,  b]  satisfy  the  three-term 
recurrence  relations 


ro(z(i))  =  l  Ti(a(t))  =  z(t) 

(10) 

Tkiz{t))  =  2z(t)Tk-i{z{t))  A:  =  2,..., 

2t-b-a 
6-a  ’ 

-1  <  <<  1- 

(11) 

Let 


w(t)  =  (b-t)  -  a) 
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be  the  weight  for  the  interval  [a,  b].  ^  .  t  ^ 

We  begin  with  describing  the  iterative  method  for  the  exponential  Hlter  function.  Introduce  the 

expansion  of  =  e  in  the  2^  (2 (t)), 

i=o 


ao  =  i  /  ig(f)w{t)dt, 

a,  =  1  i  =  l,2,.... 

IT  Ja 

and  denote  its  k-th  partial  sum  by  Sfc(t).  Observe  that 
-  2/3%(x)  = 

Multiply  Sfc(t)  by  -2^t  and  integrate  in  t.  This  yields  a  polynomial  qk+2{t)  of  degree  fc  +  2,  which 
can  be  written  in  the  form 

9.+2(‘)=i:wzw)+c  (1^) 

f=i 

where  the  constant  C  is  chosen  so  that  g/i+2(0)  =  1.  Furthermore 

^.(0)  =  0. 

It  can  be  shown,  see  [2],  that  the  polynomials  qj  satisfy  a  five-term  recurrence  relation  of  the  form, 
»=n-3 

The  coefficients  >  4,  can  be  computed  from  the  (n  -  l)-st  Fourier  coefficient  of  the 

expansion  of  e'^^'^in  the  Tj  according  to  the  following  formulas 

a("+«  =  -ADa„-,^  4?/'  = 

The  following  Lemma  shows  that  the  Fourier  coefficients  Ofc  satisfy  also  a  five-term  recurrence 
relation. 

Lemma  2.1  The  coefficients  of  the  expansion  of  e-^*"  in  Chebyshev  polynomials  for  the  interval 
[a,  6]  satisfy  the  following  five-term  recurrence  relations 

’  -2ao  -  2^ai  -  p^^a2  +  2^a3  -I-  04  =  0 
and  for  i>2, 

-ai_2  -  2|^o.-.i  -  -f  25±f  a,+i  +  ai+2  =  0 
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It  follows  from  Lemma  2.1  that,  for  fixed  m,  after  computing  oq,  oi,  a^-i,  Om  to  high  accuracy, 
the  remaining  m  —  3  Fourier  coefficients  can  be  obtained  by  solving  a  pentadiagonal  linear  system 
of  equations.  Therefore  the  (n  +  l)-th  approximate  solution  for  the  iterative  method  determined 
by  the  residual  polynomials  gf.  is  of  the  form 

a!“+V>,  ">4, 

where 

=  -sr:’’* 

yd)  = 

Each  iteration  requires  one  matrix-vector  multiplication  with  the  matrix  A.  We  would  like  to 
terminate  the  iterations  when  the  error  in  the  approximation  of  the  filter  function  by  the  polyno¬ 
mials  qk  is  sufficiently  small.  Since  the  coefficients  of  these  polynomials  are  related  to  the  Fourier 
coefficients  ojt  in  a  fairly  straightforward  manner,  iterations  are  terminated  when  la^,}  is  sufficiently 
small. 

We  remark  similar  iterative  schemes  can  be  defined  for  different  choices  of  the  filter  function. 
The  iterative  method  for  Tikhonov  filter  function,  for  example,  can  be  defined  simply  by  setting 
=  '^+1  •  The  following  lemma  gives  analytic  formulas  for  the  computa¬ 
tion  of  the  Fourier  coefficients  of  the  expansion  of  in  Chebyshev  polyomials  for  the  interval 

[a,  6]. 

Lemma  2.2  If 

1)2  ^  ,  t  €  [a, b] 

then 


ao 

an 


^  ^Re-  ^ 


.  -  a) 


yjz^  -  I  -f 


_  .  .\/^{b—a)  /-g - -  z 

^{b  —  a)'^  (z2  _  2  ^  —  1 


),  n  =  1, 


where  z  =  — —  i ^  and  Re(z)  denotes  the  real  part  of  a  complex  number  z. 

3  Selection  of  Regularization  Parameter 

Often  in  applications  the  right-hand  side  vector  g  in  (1)  is  contaminated  by  errors.  Separate  the 
right-hand  side  into  two  components, 

S  =  5““'+>?, 
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where  is  the  unperturbed  and  unknown  right-hand  side  vector  and  77  is  the  error.  Assume 

that  the  norm  of  the  error  satisfies 


Ikll  <  ^ 

with  the  bound  5  explicitly  known. 

We  want  to  determine  the  value  of  the  regularization  parameter  p*  which  satisfies 
/or)  =  ||«- .4x511 -i  =  0, 

This  method  for  determining  the  regularization  parameter  is  often  referred  to  as  the  Morozov 
discrepancy  principle  and  is  discussed,  e.g.,  by  Engl  [3]  and  Groetsch  [4].  Introduce  the  function 

=  (I’’) 

Then  a  solution  to  the  equation  g{P)  =  0  is  also  a  solution  of  /(/3)  =  0.  Since  g{P)  is  a  decreasing 
and  convex  function  of  p,  Newton’s  method  is  globally  and  monotonically  convergent  when  starting 

with  a  value  po  <  P*".  ,  ,  •  i.  f 

The  iterates  computed  by  Newton’s  methods  can  be  expressed  in  a  particularly  convenient  torm 

in  the  case  of  the  exponential  filter  function 


Pn+l  =Pn  + 


where  =g-  Ax^^.  Therefore  the  computation  of  a  new  value  of  the  regularization  parameter, 
say  does  not  require  any  additional  work  in  addition  to  the  computation  of  xp^  by  the  iterative 
method  described  in  section  2. 

Unfortunately  the  iterates  for  Newton’s  methods  cannot  be  simplified  to  such  efficient  formu¬ 
lation  when  the  regularization  is  determined  by  Tikhonov  filter  function.  In  fact,  in  view  of  the 
additional  computations  which  would  be  required  to  apply  Newton’s  method,  it  is  more  convenient 
to  determine  the  value  of  p  which  satisfies  Morozov  discrepancy  principle  by  the  secant  method, 
whose  global  convergence  when  starting  with  two  values,  say  p-i  and  /3o,  less  than  /?*.  The  iterates 
for  the  secant  method  can  be  written  in  the  form 


Pn-\-\  =  /3n  + 


4  Efficient  update  of  the  solution  vector 

We  now  describe  how  to  computed  a  correction  of  the  vector  which  yields  the  solution  a;/3„+, 
corresponding  to  a  a  new  value  of  the  regularization  parameter  /3n+i- 
Let  and  introduce 

A73  =  ^n+l  ~  Pn  • 


Then 

where  is  the  Moore-Penrose  generalized  inverse  of  A.  Since 


(18) 
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=  (/-e  {/3n+A/3)A2^^t^. 

=  (/  -  e-^’^^^)A^g  +  {I  - 


where  =  g~  Ax0^,  to  compute  it  suffices  to  compute  the  quantity  (/  —  e~^^'^‘)j^r0^ , 
for  example  by  the  technique  described  in  section  2.  We  remark  that,  when  /?„+!  is  close  to  a 
zero  of  5(/3),  the  approximation  of  requires  a  polynomial  qk{t)  of  lower  degree  than  when 

approximating 

The  updating  scheme  for  the  solution  vector  when  a  new  regularization  parameter  has  been 
determined  in  the  case  were  the  regularization  is  determined  by  Tikhonov  filter  function  is  based 
on  the  following  observation.  Since 


=  {!-l,en+lA^  +  ir')A^g 
=  +(I-  W.A^  +  I)Wn+lA^  +  ir^)A'T.x, 

we  can  compute  the  correction  term  to  X0^  in  the  previous  formula  by  defining  an  iterative 
method  whose  residual  polynomials  qk[t)  approximate  the  function  It  can  be  shown  that, 

as  the  distance  between  /3„  and  /3n+i  becomes  smaller,  this  updating  procedure  become  faster  that 
computing  by  approximating 
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Abstract 

We  discuss  several  techniques  for  finding  leading  eigenvalues  and  eigenvectors  for  large  sparse  matrices.  The 
techniques  are  demonstrated  on  a  scalar  Helmholtz  equation  derived  from  a  model  semiconductor  rib  waveguide 
problem.  We  compare  the  simple  inverse  iteration  approach  with  more  sophisticated  methods,  including  min¬ 
imum  degree  reordering,  Arnoldi  and  Lanczos  methods.  We  then  propose  a  new  Amoldi  method  designed 
particularly  for  the  constrained  generalized  eigenvalue  problem,  a  formulation  arising  naturally  from  the  scalar 
waveguide  problem. 


1  Problem  Formulation 

In  the  waveguide  analysis  (see  e.g.  [9]),  the  analysis  of  the  propagation  of  the  electric  and  magnetic  fields  in 
waveguides,  based  on  the  use  of  Maxwell’s  equations,  often  lead  to  scalar  Helmholtz  equations  of  the  general  form 
y2  ^  p’Ex  =  0^Bx-  where  Ex  denotes  the  x  component  of  the  electric  field,  k  is  the  dielectric  constant,  /3  is 
the  unknown  Vopagation  constant,  and  denotes  the  Laplacian  operator  in  the  transverse  {x,y)  coordinates  (z 
being  the  longitudinal  coordinate).  When  this  Helmholtz  equation  is  solved  for  the  i  coordinate  only,  one  nmst 
impose  internal  continuity  conditions  across  the  boundaries  between  materials  of  different  dielectric  constants.  The 
combined  equations  are  then  discretized  by  finite  differences  yielding  a  large  sparse  ordinary  matrix  eigenvalue 
problem  Ax  =  Ax  [9].  Discretizing  the  vector  fields  equations  using  finite  element  formulations  leads  to  a  large 
sparse  generalized  eigenvalue  problem  [2]; 

Ax  =  AHx  (1) 

where  the  matrix  A”"""  is  real  and  non-singular,  H"’'"  is  real,  symmetric,  and  positive  definite. 

Solving  problem  (1)  constitutes  the  largest  part  of  the  computational  effort.  In  this  paper  we  are  discussing 
several  iterative  algorithms  that  can  efficiently  deal  with  problem  (1)  of  several  thousands  in  size  on  worksta¬ 
tions.  When  B  is  banded,  as  is  the  case  in  many  problem  formulations  from  waveguide  analysis,  the  Cholesky 
decomposition  B  =  Li’’  problem  (1)  can  be  used  to  reduce  (1)  to  an  ordinary  eigenvalue  problem; 

L~^AL~'^y  =  Ay  (2) 

where  L  is  lower  triangular  and  y  =  i’’x.  In  an  iterative  method,  i-ML“’’  dose  not  need  to  be  explicitly 
formed.  This  is  possible  because  most  iterative  methods  only  need  to  form  matrix-vector  products.  When  B  is 
not  banded  but  its  Cholesky  decomposition  gives  a  sparse  i,  the  reduction  is  still  useful.  But  if  L  is  not  sparse, 
then  solving  problems  such  as  (1)  becomes  significantly  more  difficult  and  expensive.  Since  waveguide  analysis 
leads  to  generally  banded  matrices,  in  this  paper,  we  will  mainly  concentrate  on  the  ordinary  eigenvalue  problem 
(2),  assuming  that  the  multiplication  by  A“^  is  feasible,  and  considering  the  matrix  L^^AL-'^  as  A,  if  appropriate. 

As  a  model  test  case,  we  derived  the  matrix  A  from  the  scalar  Helmholtz  equation,  combined  with  suitable 
internal  continuity  conditions,  applied  to  a  typical  semiconductor  rib  waveguide.  See  [9]  for  the  details.  In  our 
examples,  we  used  a  matrix  of  modest  size  (1800  x  1800)  with  a  zero  structure  shown  in  Figure  1.  Computations 

were  carried  out  in  MATLAB. _ 

'This  work  was  funded  in  part  by  NSF  grant  CCR-9405380. 


645 


2  Inverse  Iteration,  Minimum  Degree  Reordering  and  Deflation 

Inverse  iteration  is  a  simple,  powerful  and  effective  method  to  compute  the  smallest  modulus  eigenvalue.  Since 
in  the  waveguide  analysis  the  dominant  mode  (in  the  sense  of  largest  real  part),  which  is  desired,  generally  does 
not  correspond  to  the  largest  or  smallest  eigenvalue  in  modulus,  it  is  often  necessary  to  combine  a  shift  cr  into 
the  iteration.  Iterating  with  matrix  {A  —  cr7)~^  will  yield  the  eigenvalue  closest  to  the  given  a.  The  convergence 
factor  is: 


P  = 


1-^1  - 

IA2  -ct| 


(3) 


where  Ai  is  the  closet  eigenvalue  to  tr  and  A2  is  the  second  close  one.  We  can  see  that  if  cr  is  much  closer  to  Ai 
than  to  Aa,  the  convergence  can  be  very  fast.  Each  iteration  requires  that  a  system  of  linear  equations  of  the 
form  {A  —  cr/)xne„  =  Xoid  be  solved,  often  by  a  variant  of  LU  Factorization  [4].  If  a  is  updated  using  Rayleigh 
quotients  during  the  iteration,  then  the  iteration  converges  quadratically  in  general,  and  cubically  if  the  matrix 
is  symmetric  [4],  but  changing  a  requires  that  {A  -  a  I)  be  re-factored  from  scratch  in  each  iteration  at  great 
expense.  So  often  the  shift  is  fixed,  allowing  the  factoring  to  be  done  just  once,  where  the  shift  is  estimated  in 
advance  by  physical  considerations.  On  the  other  hand,  when  only  the  largest  eigenvalue  in  magnitude  is  needed, 
the  more  straightforward  power  iteration  may  be  used,  but  this  method  may  suffer  from  slow  convergence  when 
the  first  and  second  eigenvalues  in  magnitude  are  close  to  each  other.  The  Inverse  Iteration  is  summarized  in  the 
following. 


Algorithm  2.1  Basic  Inverse  Iteration: 

1.  Choose  a  random  unit  vector  xo  and  a  shift  cr; 

2.  Compute  the  LC  decomposition  of  A  —  cr/; 

3.  Iterate:  for  i  =  1, 2, ...  until  convergence  do 

4.  Solve  Ly  =  xq  and  Uk  =  y; 

5.  norm  =  ||xjj2,  X  =  x/norm; 

6.  Compute  A  =  x’^Ax  =  (x^xo)/norm; 

7.  Compute  residual  r  =  l|Ax  -  Ax[|oo; 

8.  If  r  is  satisfied,  then  stop;  Otherwise  xq  =  x; 

9.  Optional  If  r  =  0  (mod  8  or  10),  then  set  cr  =  A, 
and  compute  the  LU  decomposition  of  (A  —  cr/). 

{optional:  for  Rayleigh  quotient  iteration  adaptive  algorithm) 

10.  end 


Line  9  is  an  adaptive  shift  step  for  faster  convergence  which  can  yield  quadratic  convergence  (cubic  if  A  is 
symmetric).  However,  it  requires  that  the  LU  decomposition  be  repreated,  which  is  very  expensive,  and  especially 
when  A  is  not  symmetric,  the  converged  eigenvalue  may  not  be  the  one  that  is  closest  to  the  original  shift.  Thus 
often  line  9  is  skipped,  or  performed  only  once  during  several  passes  through  the  main  loop. 

If  the  matrix  A  is  banded,  we  can  reorder  A  using  the  minimum  degree  ordering  before  we  do  the  LU  decomposi¬ 
tion  [3].  This  is  a  heuristic  algorithm  that  reorders  the  equations  so  that  (hopefully)  the  resulting  L  and  U  factors 
are  more  sparse,  i.e,  have  fewer  non-zero  elements.  Then  the  total  number  of  flops  needed  by  the  algorithm  will  be 
considerably  reduced.  For  the  details  of  minimum  degree  reordering,  please  refer  to  [3].  For  these  experiments,  we 
have  taken  advantage  of  MATLAB’s  built-in  functions  for  various  reordering  algorithms  including  the  minimum 
degree  reordering. 

Algorithm  2.2  Inverse  Iteration  with  Minimum  Degree  Reordering: 

1.  Choose  a  random  unit  vector  Xq  and  a  shift  cr; 

2.  Find  the  minimum  degree  reordering  permutation  P,  replace  A  with  PAP; 

3.  Compute  the  LU  decomposition  of  A  -  cr/; 

4.  The  rest  is  the  same  as  the  Basic  Inverse  Iteration  (Algorithm  2.1,  steps  3-8). 

With  MATLAB  we  carried  out  a  series  of  numerical  experiments  on  a  1800  x  1800  matrix  described  in  Section 
1.  Algorithm  2.1  and  2.2  were  used  respectively  to  find  the  dominant  mode  which  corresponds  to  the  eigenvalue 
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Inverse  Iteration 

#  Eigenvalues  Sought 

Iters 

Flop  Count 

Residual  Norm 

Basic 

1 

4 

8,255,268 

6.4E-5 

Reordering 

1 

4  ^ 

5,225,559 

7.4E-5 

Reordering 

2 

18 

8.485,461  1 

3.9E-5 

Table  1:  Comparison  of  Basic  and  Reordering  Inverse  Iteration 


A  =  3.4404,  fixing  the  shift  at  a  =  3.4.  Prom  Table  1  we  can  see  that  reordered  inverse  iteration  can  save  over  1/3 
of  computation  needed  by  the  basic  inverse  iteration.  We  should  point  out  that  in  MATLAB  the  LU  decomposition 
process  may  involve  pivoting  and  this  pivoting  may  reduce  some  of  the  advantage  gained  from  reordering.  To 
see  the  potential  advantage  that  could  be  gained  just  from  minimum  degree  ordering  if  pivoting  is  turned  off,  we 
computed  the  flop  count  for  the  Cholesky  factorization  on  a  shifted  matrix  A  (like  LU  Decomposition  without 
pivoting  for  symmetric  positive  definite  matrices  [4]).  The  resulting  Flop  Counts  with  and  without  minimum 
degree  reordering  are  2,013,624  and  3,685,377,  respectively,  representing  a  savings  of  45%  over  the  unreordered 
version! 

If  we  have  already  obtained  the  leading  eigenvalue  and  need  to  find  the  next  one  closest  to  the  known  eigenvalue, 
we  can  use  the  so  called  deflatim  technique,  described  as  follows  [8],  Suppose  A  and  u  are  the  known  eigenpair 
of  A,  and  v  is  a  vector  such  that  v^u  =  1.  Then  it  is  easy  to  see  that  the  matrix  A'  =  A  -  ■yuv'^  has  the  same 
spectrum  as  A  except  that  the  one  eigenvalue  A  has  been  shifted  to  A  —  7.  For  the  power  iteration  A  can  be  used 
directly  in  the  iteration.  But  we  cannot  do  so  in  the  case  of  the  inverse  iteration,  because  forming  A  explicitlj 
is  too  expensive  and  will  destroy  the  sparsity.  We  here  propose  an  efficient  process  that  avoids  forming  A  .  Since 
what  the  inverse  iteration  needs  is  the  matrix-vector  product  x  =  A'^^Xq-  We  express  A'~^  in  terms  of  A~\ 
whose  LU  decomposition  is  already  available.  By  the  Sherman-Morrison  formula  [4]  and  using  A-^u  =  A-^u,  we 
have 

=  A“^ -I-7A''^u(1 -7v^A“^u)~^v^A“^ 

=  A~^  + 

A-7 

=  (/  -h  Puv'^)A~^  (4) 

where  j3  =  Since  we  already  have  LU  decomposition  of  A,  we  can  first  compute  y  =  A"^xo,  and  then 

compute  the  product  x  =  A'-^Xq  =  y  +  /3(v^y)u.  Combining  this  into  Algorithm  2.1  and  2.2  is  the  inverse 
iteration  with  deflation.  We  would  like  to  point  out  here  that  this  deflation  technique  can  be  cascaded  multiple 
time,  or  generalized  to  the  block  version,  e.g.  deflation  of  multiple  eigenvalues  at  the  same  time,  by  using  the 
Sherman-Morrison- Woodbury  formula  [4]. 

There  are  various  choices  for  the  vector  v.  The  Hotelling’s  deflation  chooses  v  as  the  left  eigenvector  w,  e.g. 
w^A  =  Aw^  or  A^w  =  Aw.  The  advantage  of  this  choice  is  that  A'  will  have  the  same  eigenvectors  as  A.  So 
w'hen  A  is  symmetric,  the  right  cind  left  eigenvectors  are  the  same,  we  can  more  easily  compute  the  product; 

X  =  A'^^xo 

=  {I  +  0uv'^)A~^xo 

=  y-l- (A  -  7)~^(u^xo)u  (5) 

where  y  =  A^^Xq.  The  other  common  choice  just  takes  v  =  u.  But  this  choice  will  not  preserve  the  eigenvectors. 
So  after  the  eigenvalue  is  found,  we  need  another  iteration  to  determine  the  eigenvector. 

The  other  strategy  to  compute  more  than  one  eigenpair  is  block  inverse  iteration  (also  known  as  inverse 
simultaneous  iteration).  In  this  iteration,  we  choose  a  shift  a  and  several  random  vectors  at  the  start,  say 
Xo  =  [xi,X2,  and  then  for  i  =  1,2, ...  until  convergence,  iteratively  compute: 


II 

1 

T 

=  QiRi 

ATi+i 

=  Qi 

(6) 
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Equation  (6)  is  the  result  of  the  QR  decomposition  in  the  previous  line  which  orthogonalizes  the  columns  of  Xi. 
If  the  iteration  converges,  then  the  upper  triangular  matrix  Rk  will  tend  to  be  the  leading  kx  k  part  of  the  Schur 
canonical  form  of  (A  —  crl)~^  [8],  that  is  the  diagonal  elements  of  Rk  will  be  close  to  the  first  k  eigenvalues  of 
(A  —  al)~^  in  order  of  magnitude.  So  the  eigenvalues  nearest  to  a  are  obtained.  The  matrix  Qk  wall  be  close  to 
the  corresponding  Schur  vectors.  If  y  is  a  eigenvector  of  Rk,  then  the  Ritz  vector  x  =  Qky  is  an  approximate 
eigenvector  of  A. 

Algorithm  2.3  Block  Inverse  Iteration: 

1.  Choose  a  set  of  random  vectors  Xq  =  [xi,X2,  and  a  shift  cr; 

2.  Compute  the  LU  decomposition  of  A  -  cr/; 

3.  Iterate;  for  i  =  1,2, ...  until  convergent  do 

4.  Solve  LY  =  Xq  and  UX  =  Xq-, 

5.  Compute  QR  decomposition  X  =  QR-, 

6.  If  i  =  0  mod  k, 

7.  Compute  the  residual  norm  \\AQ  -  QRH  =  \\AQ  -  Ajl. 

8.  To  compute  AQ,  solve  LZ  =  Q  and  UY  ~  Z. 

9.  If  satisfied,  then  stop.  Otherwise,  Xo  =  Q  and  go  on  iteration; 

7.  end 

8.  Compute  the  eigenvector  y  of  R,  and  the  eigenvector  of  A  is  x  =  Qy. 

Note  that  the  number  m  of  iterated  vectors  should  be  leirger  than  the  number  k  of  eigenvalues  needed.  Depending 
on  the  spectrum  distribution,  normally  we  could  take  m  =  fc  +  2  if  /c  is  two  or  three. 

3  Arnoldi  and  Lanczos  Method 

Arnoldi  method  for  the  algebraic  eigenvalue  problem  is  very  much  like  Galerkin  method  for  the  approximation 
of  eigenfunctions.  Here  we  just  give  a  brief  derivation  of  the  method.  The  Galerkin  method  uses  a  subspace  to 
form  the  approximate  eigenfunctions.  In  a  similar  way,  the  Arnoldi  method  takes  a  subspace  X  C  /?"  to  extract 
approximate  eigenvectors.  The  subspace  is  chosen  to  be  the  Krylov  space  of  the  form  X  =  {vq,  Avq,  ...,  A'""^  vq}, 
where  vq  is  an  arbitrary  vector  and  m  is  typically  much  smaller  than  n.  Suppose  we  have  a  set  of  orthonormal 
vectors  v;  that  form  the  basis  of  X,  and  denote  matrix  Vm  =  [vi,V2,  ...,v^].  Take  x  =  V-my  €  X,  and  impose  the 
Galerkin  condition  to  the  residual  r(A,x): 


r(A,  x)  =  Ax  -  Ax  1 


(7) 


which  is  equivalent  to: 


(AV-,„y-AE;ny,Vi)  =  0,i  =  1,2,  ...,  m  (8) 

Write  (8)  in  the  matrix  form  and  denote  =  V^AV,  we  then  obtain  a  reduced  approximation  problem  of 
smaller  size: 


ffmy  =  Ay  (9) 

Now  the  question  is  how  to  form  the  orthonormal  basis  Vj  of  X.  Here,  the  modified  Gram-Schmidt  orthogonaliza- 
tion  process  can  be  used.  Each  Avj  is  generated,  it  is  orthogonalized  against  all  Vi  with  i  <  j.  It  is  now  known 
that  this  reduced  problem  can  indeed  give  good  approximation  to  large  modulus  eigenvalues  [7].  Below  is  the 
algorithm. 

Algorithm  3.1  Arnoldi  Algorithm: 

1.  Choose  an  random  unit  vector  vi ; 

2.  Iterate:  for  j  =  1,2, ...,  m  do 

3.  Compute  w_,-  =  Avj; 

4.  For  i  =  1,2,  ...,j  do 

5.  hy  =  (v.-,wy); 
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Wj  =  Vfj  -  hijVf, 


6. 

7.  end 

8.  hj+ij  =  ||Wj||.  If  hj+i,j  =  0,  then  go  to  line  11; 

9.  Vj+i  = 

10.  end 

11.  Compute  all  the  eigenpairs  A,  y  of 

12.  Select  the  first  k  A’s  in  order  of  magnitude,  and  compute  the  eigenvectors  Xj  =  Vyi{i  =  1, 2,  ...k)\ 

13.  Compute  the  residual  ||Axt  -  AjXiHoo  =  If  satisfied,  then  stop; 

Otherwise  increase  m  by  5  or  10,  set  vq  =  xi  and  restart  iteration  from  line  2  again. 

In  line  13  the  residual  expression  is  obtained  by  multiplying  yi,  the  eigenvector  of  Hm,  to  the  following  equation, 
which  comes  straight  from  line  2  to  9. 

In  line  8,  if  hj+i.j  =  0,  then  equation  (10)  will  becomes 

AVj  =  (11) 

The  above  equation  indicates  the  exact  invariant  subspace  Vj  of  A  is  obtained,  and  therefore  the  eigenvalues  from 
line  11  will  be  the  exact  eigenvalues  of  A.  In  practice,  this  is  a  rare  case.  The  choice  of  m  depends  on  the  problem 
size,  spectrum  distribution  and  the  number  of  eigenvalues  wanted.  For  a  matrix  of  several  thousands  in  size, 
choosing  m  =  20  ~  200  may  be  good  enough  if  about  first  4  ~  10  top  modulus  eigenvalues  are  wanted. 

The  shift  and  inverse  strategy  as  previously  described  for  the  inverse  iteration  can  also  be  naturally  combine 
into  Algorithm  2.1.  In  case  the  eigenvalues  near  a  given  point  are  wanted,  this  can  be  the  winning  strategy,  as  the 
numerical  experiment  shows. 

Algorithm  3.2  Shift  and  Inverse  Arnoldi  Algorithm: 

1.  Choose  an  random  unit  vector  vi  and  shift  tr; 

2.  Compute  the  LU  decomposition  of  [A  -  al); 

3.  Iterate:  for  j  -  1,2, ...,  m  do 

4.  Solve  Ly  =  Vj  and  I7w  =  y,  =  w; 

5.  Continue  as  in  Algorithm  3.1,  steps  4-9. 

6.  Compute  eigenvalues  and  vectors  of  [A  —  as  in  Algorithm  3.1,  steps  11-12, 
mapping  the  eigenvalues  back  to  those  of  A. 

While  the  Arnoldi  method  reduces  the  large  matrix  A  into  a  small  upper  Hessenberg  matrix  Hm ,  the  Lanczos 
method,  however,  reduces  the  matrix  into  a  small  tridiagonal  matrix,  and  solve  the  tridiagonal  matrix  for  eigen¬ 
values  as  the  approximation  [1].  In  the  Lanczos  method  the  matrix  Ki  will  not  be  orthonormal.  Instead,  another 
n  X  m  matrix  will  be  formed  such  that  W^Vm  =  Im  and  W^AVm  =  Tm,  where  Tm  is  the  tridiagonal  matrix. 
The  shift  and  inverse  strategy  can  be  incorporated  into  the  Lanczos  as  Algorithm  3.2. 

Algorithm  3.3  Lanczos  Algorithm 

1.  Choose  two  vectors  vi  and  wj  such  that  (vi,  w^)  =  1  Set  =  0,vo  =  wq  =  0 

2.  Iterate:  for  i  =  1,2,  ...,m  do 

3.  Oi  =  (Avi.Wf); 

4.  V  =  Avi  -  OjVi  -  PiVi-i ; 

5.  w  =  —  cciWi  -  ; 

6.  ^i+i  =  Kv,w)|i/2. 

7.  A+i  =  (v,w)/5i+i; 

8.  Vi+i-v/pi+i; 

9.  Wi+i 

10.  end 

11.  Compute  the  eigen  pairs  of  Tm  =  [Si,ai,Pi]-, 

12.  Select  the  first  k  eigenvalues  Xi  in  order  of  their  magnitude; 
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Iterative  Method 

k 

m 

Flop  Count 

Residual  Norm 

Arnoldi 

4 

15 

4.1E-4 

Lanczos 

4 

17 

12,365,568 

1.5E-5 

Block  Inverse  itr. 

3 

’  n.a. 

53,560,729 

4.4E-5 

Table  2:  Comparison  of  FLOPS  by  Arnoldi  and  Lanczos.  k  is  the  number  of  eigenvalues  sought,  and  m  is  the 
number  of  vectors  generated. 

13.  Compute  the  corresponding  eigenvectors  Xj  =  ymyi; 

14.  Compute  the  residual  norm  j|ylxj  —  AtXt||oo; 

15.  If  satisfied,  then  stop.  Otherwise,  increase  m  and  start  over  again. 

We  used  Arnoldi,  Lanczos  and  block  inverse  iteration  to  solve  the  same  problem  as  in  Section  2.  Shift  and 
inverse  scheme  and  the  minimum  degree  ordering  were  all  combined  in  each  algorithm.  And  five  eigenvalues 
closest  to  the  shift  <t  =  3.4  were  computed.  The  results  are  A  :  3.4408,3.5665,3.1464,3.7539, 2.8379.  Table  2  gives 
the  comparison  of  flops.  It  is  apparent  that  the  Arnoldi  and  Lanczos  methods  are  capable  of  computing  more 
eigenvalues  with  less  cost  than  block  inverse  iteration,  but  inverse  iteration  may  suffice  if  just  one  eigenvalue 
is  desired,  depending  on  the  distribution  of  the  spectrum.  When  the  matrix  is  symmetric,  the  the  Arnoldi  and 
Lanczos  methods  reduce  to  the  same  algorithm,  and  the  resulting  algorithm  is  known  by  the  KanieLPaige  theory  to 
have  very  nice  convergence  properties,  at  least  for  the  extreme  eigenvalues  [4].  When  the  matrix  is  nonsymmetric, 
the  Arnoldi  method  is  more  straightforward  to  implement  and  has  somewhat  better  convergence  properties,  but 
requires  more  space  and  cost  per  iteration  than  the  Lanczos  method.  In  addition,  it  is  not  entirely  a  simple  matter 
to  compute  the  eigenvalues  of  the  nonsymmetric  tridiagonal  matrix  resulting  from  the  Lanczos  method.  Deflation 
can  be  incorporated  into  both  Arnoldi  and  Lanczos  methods. 

4  Constrained  Raleigh  Quotient  Maximization  Problems 

In  the  transverse  magnetic  field  formulation  using  finite  element  method  [5],  the  following  problem  is  encountered: 

Find  (A,x)  s.t.  A=  max  (12) 

c^x=o. 

where  A  is  symmetric,  B  is  symmetric  positive  definite,  and  C  has  full  column  rank.  This  problem  would  be 
the  usual  generalized  eigenvalue  problem  if  there  were  no  constraint  C'^x  —  0.  We  call  (12)  Constrained  Raleigh 
Quotient  Maximization  Problem(CRQM).  Mathematically,  (12)  is  equivalent  to  the  following  regular  eigenvalue 
problem: 


V'^AVy  =  V^BVy  (13) 

where  the  matrix  V  consists  of  an  orthonormal  basis  spanning  the  constraint  space  {x  :  C^x  =  0).  In  case  that 
the  matrix  size  is  small,  we  can  determine  the  matrix  V  by  the  Singular  Value  Decomposition  and  solve  the  dense 
problem  (13).  However,  if  the  matrix  size  is  as  large  as  several  thousands,  this  approach  becomes  computationally 
intractable  on  workstations.  Here,  we  propose  an  algorithm  that  can  efficiently  deal  with  large  sparse  case  of 
CRQM. 

The  algorithm  is  based  on  the  Arnoldi.  The  regular  Arnoldi  forms  an  orthonormal  basis  of  the  Krylov  space 
1C.  As  each  new  basis  vector  Avj  is  generated,  it  is  orthogonalized  against  all  Vj,  1  <  t  <  j  -  1.  Now  we  modify 
this  process  by  making  Avj  orthogonal  not  only  to  all  previous  Vj’s  but  also  orthogonal  to  all  the  column  vectors 
of  C.  The  column  vectors  of  the  resulting  matrix  Vm  will  be  the  basis  of  a  “projected  Krylov  space”  and  will  be 
very  good  components  to  form  approximate  solution  vectors  for  CRQM. 


650 


Algorithm  4.1  Arnoldi  Algorithm  for  CRQM; 

1.  Choose  a  unit  vector  vi  such  that  =  0; 

2.  For  j  =  1,2,  ...,m  do 

3.  Compute  Wj  =  Avj; 

4.  For  i  =  1, 2, k  do 

5.  t»j  =  (ci,Wj); 

6.  Wj  =  Wj  -  tij Ci] 

7.  end 

8.  For  i  =  1,2, j  do 

9.  feij  =  (vi,Wj); 

10.  Wj  =  Wj  -  hijVi; 

11.  end 

12.  hj+ij  -  ||wj||2.  If  /ij+ij  =  0,  then  go  to  14; 

13.  Vj+i  =y'j/hj+i,j. 

14.  Compute  the  eigenvalues  of  Hm  and  choose  the  k  largest  AjS,  where  k  <m. 

15.  Compute  eigenvectors  yj  of  Hm  associated  with  Aj,  and  the  CRQM  solution  vector  Xj  =  Vmyi,  i  —  I,  ■  ■  ■  ,k. 

We  have  the  following  relations  for  the  above  algorithm. 

=  VmHm  +  CTm  +  hm+l,mVm+iel  (14) 

V^AVm  =  Hm  (15) 

C^Vm  =  0  (16) 

Notice  that  equation  (15)  lays  the  groundwork  for  the  algorithm  that  the  Ritz  vector  x  =  y„,y  is  an  approximation 
solution  of  CRQM. 

It  is  seen  that  the  Arnoldi  method  is  easily  adapted  to  incorporate  the  constraints.  However,  this  method  is  still 
under  investigation,  and  though  we  expect  the  behavior  to  be  very  similar  to  the  unconstrained  Arnoldi  method, 
further  numerical  experiments  are  needed.  In  addition,  the  convergence  theory  has  not  yet  been  developed  for 
this  specific  algorithm.  This  work  is  in  progress  and  will  be  reported  soon. 

5  Conclusions 

Arnold!  and  Lanczos  methods  are  promising  for  the  large  sparse  eigenvalue  problems  in  waveguide  analysis.  The 
computation  cost  of  these  two  methods  is  far  less  than  that  of  inverse  iteration,  especially  when  several  eigenvalues 
are  sought.  Strategies  such  as  reordering,  or  “shift  and  inverse,”  can  be  easily  combined  in  these  methods.  And 
either  of  them  can  be  modified  to  fit  new  formulations  of  finite  element  method.  Reordering  technique  can 
significantly  save  computation,  and  should  be  explored  whenever  possible.  We  have  also  found  MATLAB  to  be  a 
convenient  and  efficient  tool  for  problems  with  size  reaching  of  several  thousands. 
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Original  Matrix  LU  Factors  of  Original  A  with  pivoting  disabled 


Reordered  Matrix  LU  Factors  for  Reordered  Matrix 


Figurel.  Matrix  Non-Zeroes  showing  original  matrix  and  LU  factors  combined.  Upper  figures  show  original  order, 
lower  figures  show  minimum  degree  ordering,  nz  is  the  total  number  of  nonzeroes. 
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Abstract 

The  method  of  moments  leads  to  a  linear  equation  system  that  is  usually  solved  by  an  Lb 
decomposition.  Iterative  methods  are  an  alternative,  but  need  an  efficient  preconditioner  in 
general.  We  will  show  that  there  is  a  preconditioner  for  special  configurations,  which  comes 
up  from  the  interpretation  of  the  system  matrix  in  a  simple  manner.  Together  with  a  powerful 
iterative  method  (GMRES),  this  approach  is  clearly  superior  to  the  direct  method. 


1  Introduction 

The  method  of  moments  is  an  efficient  system  for  treating  scattering  and  radiation  problems  m 
configurations  of  complex  geometry.  The  method  leads  to  the  solution  of  a  linear  equation  systern 
that  may  contain  a  high  number  n  of  unknowns.  Solving  this  equation  system  by  the  classical 
direct  methods  requires  a  computational  effort  of  order  vP .  Looking  closer  to  the  structure  of  the 
system  matrix,  one  observes  that  dependent  on  the  geometry  of  the  scattering  problem  the  system 
matrix  has  a  pronounced  block  structure  and  is  not  necessarily  completely  filled.  This  behaviour 
becomes  very  evident  if  a  double-current  approach  for  dielectric  or  magnetic  boundaries  is  used 
where  each  part  of  the  current  determines  the  field  in  one  part  of  the  space  (left  or  right  hand 
side  of  the  boundary).  Considering  geometries  that  consist  of  several  galvanically  not  coupled 
substructures,  one  gets  e.g.  a  system  matrix  Z  with  diagonal  blocks  which  correspond  to  the 
interactions  of  the  equivalent  currents  on  the  substructures,  and  non-diagonal  blocks  ^  y, 

which  correspond  to  the  mutual  interactions  of  the  substructures.  The  question  occurs,  whether 
this  structure  could  be  used  to  reduce  the  numerical  effort  in  solving  the  equation  system. 

Particular  versions  of  direct  methods,  which  make  use  of  this  matrix  structure,  have  the 
disadvantage  that  they  only  reduce  the  computation  time  of  a  very  limited  class  of  problems. 
Concerning  the  geometry,  an  appropriate  solver  has  to  be  chosen  that  may  also  need  a  multitude 
of  information  on  the  matrix  structure  that  have  to  be  qualified  in  advance.  By  this  procedure 
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Figure  1:  N  scatterers  in  free  space 


the  methods  lose  their  "black-box  character”.  There  are  a  lot  of  reasons  to  look  for  more  flexible 
and  simple  solvers. 

Iterative  methods  are  an  alternative.  The  choice  of  the  iterative  method  is  determined  by  the 
mathematical  characteristics  of  the  system  matrix.  A  suitable  method  for  non-hermitian  matrices 
is  GMRES(m).  Past  experience  has  shown  that  a  preconditioning  of  the  system  matrix  will  be 
necessary,  especially  if  the  s^'stem  matrix  is  very  ill-conditioned. 

In  the  above-mentioned  case  of  galvanically  not  coupled  substructures  a  block  .lacobi  method 
presents  itself  as  a  preconditioner,  where  the  blocks  of  the  preconditioner  correspond  to  the  diag¬ 
onal  blocks  Zii  of  the  system  matrix.  This  means  that  the  mutual  interaction  of  the  substructures 
is  neglected.  This  interaction  is  later  considered  by  using  the  iterative  method.  Moreover,  the  ad¬ 
vantages  of  .Jacobi  methods  are  that  they  need  only  little  storage  and  few  additional  information, 
and  that  they  are  easily  to  be  implemented  and  to  parallelized. 

In  the  following  section  it  is  briefly  dealt  with  the  underlying  treatment  of  the  scattering 
problem  by  equivalent  surface  currents  and  with  the  solution  of  the  electric  field  integral  equation 
EFIE  by  means  of  the  method  of  moments.  Afterwards  the  iterative  method  GMRES  is  introduced 
and  it  is  illustrated,  how  the  preconditioner  is  resulting  from  the  interpretation  of  the  single  blocks 
of  the  impendance  matrix.  Finally  some  examples  will  be  demonstrated  and  the  computing  times 
of  the  iterative  methode  will  be  compared  to  those  of  the  LU  decomposition. 


2  Integral  Equations,  Method  of  Moments 

Let  there  be  a  number  N  of  scatterers  in  free  space.  Each  of  these  scatterers  is  described  by 
its  surface  Fi-,  which  may  either  be  open  or  closed.  In  the  case  of  a  closed  surface,  the  encircled 
volume  can  be  filled  with  a  dielectric  material,  which  is  characterized  by  the  material  constants 
e, /i,cr  of  the  dielectric.  Therefore  such  a  surface  represents  an  interface  between  the  open  space 
and  the  dielectric.  In  the  following  the  surfaces  are  supposed  to  be  boundaries  made  from  an 
ideally  conducting  material,  then  there  is  no  field  inside  the  closed  areas.  An  incoming  plane 
wave  is  exciting  these  objects  (Fig.  1).  We  are  looking  for  the  scattered  field. 

The  fields  have  to  satisfy  some  boundary  conditions  on  the  surfaces.  As  the  surfaces  are 
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marginal  faces  made  from  ideally  conducting  material,  there  is 

Tii  X  EQ{r)  =  0  Vr  €  Fj-.  (1) 

Here  rz,  represents  the  normal  vector  on  the  surface  F;,  and  Eq  is  the  sum  of  incoming  field  and 
scattered  field  in  free  space: 


In  order  to  calculate  the  scattered  field  an  unknown  distribution  of  the  current  on  the  surface  Jj 
is  assumed.  The  electromagnetic  field  of  the  (electric)  surface  currents  is  simulating  the  demanded 
scattered  field.  The  electric  field  of  the  current  on  the  surface  F,-  results  from  the  integral 
operator  Ej  as  follows 


Ejjj  =  jujfXo 


+  4W-J;)V 

Kq 


G{R)ds, 


(1) 


with 


G{R)  = 


I  g-jkoR 


47r  R 

being  the  Greens  function  and  ko  being  the  wave  number  of  the  incoming  wave  in  free  space.  Wdh 
these  (equivalent)  surface  currents  the  demanded  scattered  field  is 


=  (2) 

J  =  1 

Combined  with  the  boundary  conditions  of  N  scatterers  there  follows  the  linear  operator  equation 
system 

jii  X  {EiJi  H - h  EnJn)  =  -‘^1  X  Vr  e  Fi, 

;  (3) 

nv  X  {EiJi  +  •  •  •  +  EpjJn)  —  ~'^N  X  Eq  ,  Vr  G  Fyv-  (4) 


To  solve  the  operator  system  the  method  of  moments  is  employed.  For  that  purpose  the  surface 
Fj  is  divided  into  Uj  patches.  For  every  two  patches  with  a  common  edge  a  basis  function 
(roof  top  function)  is  defined  and  the  particular  surface  currents  Jj  are  expanded  to  these  basis 
functions 

i/=i 

Note  that  there  are  n  =  Ejli  ^^sis  functions.  Furthermore,  a  suitable  scalar  product  on 
the  surfaces  is  defined.  The  integration  along  certain  matching  paths  fx  =  1, ...  ,nj  on  the 
surfaces  is  e.g.  such  a  scalar  product: 


with  A  being  any  vector  defined  on  P,.  Using  the  notation 

and 

Z.v  =  (Z^p,  u,  =  [of >, . . . , C/}’“>f ,  Ij  =  [/}’>. . . . ,  /}”'>]’' , 

the  following  linear  equation  system  comes  up 


Zi^ 


( Z\\ 

Z\2 

■  Zl^r  ^ 

(Ux\ 

Z22 

■  Z-2N 

h 

=  - 

U2 

=  —u, 

(6) 

Zpi2  ■  • 

\UnJ 

with  Zij  €  C”'  h,  Ui  €  ,i.j  =  1,  •  •  • ,  Ah  The  diagonal  blocks  Zu  describe  the  interaction  of 
object  i  with  itself  and  Zij  the  interaction  of  the  object  i  with  object  j.  According  to  the  arbitrary 
choice  of  the  domain  of  the  basis  functions  and  the  matching  paths  these  matrices  are  generally 
non-hermitian.  It  follows  that  the  n  x  n  system  matrix  Z  is  also  generally  non-hermitian. 


3  Short  Description  of  GMRES(m) 

Here  we  give  a  very  short  description  of  GMRES  and  GMRES(m);  for  more  detailed  description, 
see  Saad  and  Schultz  [3].  GMRES  is  an  iterative  method  for  solving  linear  systems 

Ax  =  b,  (7) 

with  A  being  a  non-hermitian  n  x  n  matrix,  6  G  C"  being  the  given  right  hand  side,  and  we 
are  looking  for  a  solution  vector  a;  G  C" .  The  method  belongs  to  the  class  of  Krylov  subspace 
methods:  it  produces  approximations  to  A~^b  of  the  form 

G  *0  +  ^kizoi  A),  /c  =  1, 2,  .  .  .  .  (8) 

Here,  xq  G  C’’"'  is  any  initial  guess  for  the  solution  of  (7),  Tq  —  B  —  Axq  is  the  corresponding 
residual  vector,  and  JCki'f'o-,  A)  its  the  k-th.  Krylov  subspace  generated  by  Tq  and  A: 

}Ck{ro,A)  =  span{rQ,  Avo. ....  A^“Vo). 

At  the  A:-th  iteration,  GMRES  computes  a  correction  Zk,  which  minimizes  the  residual  over 
fCkiro.A): 


\\b-Axk\\2=  min  ||6-Aa;||2.  (9) 

Therefore,  the  Arnold!  process  is  to  be  used  to  generate  an  orthonormal  basis  of  the  Krylov 
subspace: 
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Chose  t;i  e  C,  with  ||vi||2  =  1. 
For  j  =  1  to  A: 

(1)  hij  =  (M-.Vj),  z  = 

(2)  Uj+i  =  Avj  —  X)i=i  hijVj 

(3)  hj+ij  =  !i%+ii|2 

(4)  Vj+i  =  Vj+i/hj+ij 


The  vector  recurrences  in  step  (2)  of  the  Arnoldi  process  can  be  rewritten  compactly  in  matrix 
form  as  follows; 


AH  = 


where 

Vk  = 

[Ui,D2,---  ,Vk] 

has  orthonormal  columns,  and 

^hii 

hi2  ■  ■  ■  '' 

^21 

Hk  = 

0 

hn,n.~l  knn 

1 0 

0  fia+l,n/ 

(10) 

(11) 


(12) 


is  an  (ic  T  1)  X  A:  upper  Hessenberg  matrix  of  rank  k. 

If  one  chooses  the  starting  vector  Vi  =  ro/HroU  of  the  Arnoldi  process,  then  all  possible 
iterations  can  be  written  as 

Xk  =  xo-kVkZ^,,  z„€C^  (13) 

Moreover,  with  (10)  and  (13),  the  minimal  residual  property  (9)  reduces  to  the  (A;  +  1)  x  A;  least 
squares  problem 

Wpek  -  HkZk\\2  -  mm\\(3ek  -  Hkz\\2,  (14) 


where  =  [1,0,.  ..,0]^  G  and  jS  =  |]ro||2-  GMRES  is  an  implementation  of  the  minimal 
residual  approach  (9)  that  obtains  the  A:-th  iterative  solution  Xk  by  first  running  k  steps  of  the 
Arnoldi  process  and  then  solving  the  (A:  +  1)  x  A;  least  squares  problem  (14),  which  can  be  solved 
by  a  QR  factorization. 

Since  storage  and  arithmetic  cost  increase  as  the  GMRES  iterations  proceed,  the  algorithm 
usually  implemented  in  practice  is  GMRES (m),  where  m  is  any  maximum  allowable  number  of 
iterations,  typically  m  <^n.  In  one  cycle  of  GMRES (m),  beginning  with  a  current  approximate 
solution  cco,  up  to  m  GMRES  iterations  are  performed;  the  iterations  are  terminated  when  either 
the  residual  norm  has  been  reduced  sufficiently  or  the  full  tn  iterations  have  been  carried  out. 
It  should  be  mentioned  that  the  residual  norm  can  be  monitored  during  the  iterations  without 
actually  computing  the  correction  or  the  residual.  After  the  iterations  have  been  performed,  the 
cycle  is  concluded  with  the  computation  of  the  correction  Zm-  If  Hi’mll  is  not  small  enough,  then 

=  jco  +  Zm  is  used  as  the  new  initial  guess  vector  of  a  new  cycle  of  iterations  as  needed. 
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4  Block  Jacobi  Preconditioning 

We  are  using  GMRES(m)  for  solving  the  linear  system  eq.  (6).  Past  experience  has  shown  that 
for  a  convergence  of  the  method  an  efficient  preconditioning  technique  is  required.  Let  M  be 
a  nonsingular  n  x  n  matrix,  which  approximates  in  some  sense  the  impedance  matrix  Z  of  the 
original  linear  system  (6).  GMRES(m)  is  then  used  to  solve  the  preconditioned  system 

Z'i=-u',  (15) 

where 

Instead  of  solving  (6),  the  equivalent  system  (15)  is  being  solved.  In  this  way  GMRES(m)  gener¬ 
ates  approximate  solutions  of  the  form 

ik  =  io+  JCkirQ.Z'). 

The  interpretation  of  the  impedance  matrix  recommends  a  block  Jacobi  method:  On  the 
surfaces  of  the  bodies  sources  (basis  functions)  are  placed.  Every  row  of  the  impendance  matrix 
then  describes  the  interaction  of  these  sources  with  the  single  reference  points  (test  functions)  on 
the  surfaces.  The  interaction  with  a  source  is  strong,  as  long  as  the  reference  points  lay  on  the 
same  surface,  it  is  weak,  if  the  reference  points  lay  on  different  surfaces.  That  means  that  the 
diagonal  blocks  Za  are  equivalent  to  the  interactions  of  the  sources  with  the  bodies  themselves, 
all  other  blocks  Zij,i  ^  j  describe  the  interactions  of  sources  on  body  i  with  reference  points  on 
body  j.  If  one  neglects  in  a  first  step  the  mutual  interactions  of  the  bodies,  then 

^11 

Z72 

Znn 

generally  approximates  the  impendance  matrix  pretty  well,  as  far  as  the  mutual  interaction  of  the 
bodies  isn’t  too  strong.  This  fact  implies  to  use  M  as  a  preconditioner,  the  single  blocks  of  the 
preconditioner  then  result  from  the  physical  significance  of  the  block  matrices.  M~^  follows  from 
a  simple  LU  decomposition  of  the  single  diagonal  blocks 

Lit/i 

L2U2 

LnUn 

The  block  Jacobi  preconditioning  method  has  the  advantage  of  not  requiring  any  additional  stor¬ 
age,  as  the  diagonal  blocks  are  already  existing  in  the  impendance  matrix,  moreover  they  are  easily 
implemented  and  parallelized.  Furthermore,  the  method  is  extremely  efficient,  as  the  following 
examples  may  demonstrate. 
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5  Numerical  Experiments 

We  have  tested  the  method  by  some  examples.  In  these  examples  the  structures  are  designed 
in  such  a  way  that  galvanically  not  connected  parts  establish  an  overall  configuration.  Then  the 
resulting  equation  system  has  the  form  as  in  eq.  (6)  (arrays  of  PEC  cubes,  ex.  3  and  4,  Fig.  ). 
Moreover,  some  configurations  have  been  tested  that  have  a  not  completely  filled  system^  matrix 
(concentric  dielectric  spheres,  ex.  2)  and  configurations,  whose  analysis  requires  in  addition  to  a 
single-layer  current  distribution  according  to  the  EFIE  also  a  double-layer  current  distribution. 
Treating  these  kinds  of  problems  one  has  to  apply  also  the  magnetic  field  integral  equation  (M  ), 
which  has  not  been  explicitely  introduced  here  (model  of  a  human  being  close  to  a  car,  ex.  5 
(Fig.  2),  and  dielectric  body,  ex.  1).  For  details  see  [2]. 


close  to  a  car 

Each  of  these  examples  can  be  treated  with  a  field  calculation  program  (CONCEPT).  Here  is 
applied  a  special  version  of  the  LU  decomposition,  which  partly  takes  advantage  of  the  population 
structure  (zero  blocks)  for  solving  the  equation  system.  The  arithmetic  times  of  the  direct  method 
are  compared  with  those  of  the  iterative  methods  in  conjunction  with  the  preconditioning  time 
The  following  table  shows  that  in  every  case  the  iterative  method  is  superior  to  the  direct  one.  A 
examples  were  calculated  on  an  IBM  RS6000-580.  Stopping  criterion  for  all  problems  is  j  j  1  <  £, 
with  e  =  10~®. 


Ex. 

n(ni, . . .  ,niv) 

tw 

tpc 

ioMRES 

Ut 

its. 

5 

1 

1056  (528,528) 

66.45 

17.40 

3.96 

21.36 

11 

0.06 

2 

1248  (528,360,360) 

56.13 

14.10 

2.26 

16.36 

6 

0.02 

3 

1500  (5  X  300) 

188.33 

8.01 

2.32 

10.33 

4 

0.02 

4 

2700  (9  X  300) 

1145.68 

14.41 

7.45 

21.86 

3 

0.04 

5 

3386  (1056,1056,1274) 

1035.99 

248.57 

10.85 

259.42 

2 

0.02 

Table  1:  n  number  of  unknowns,  m  number  of  unknows  of  subproblem  tw  arithmetic  time 
for  LU  decomposition  of  the  system  matrix,  tpc  arithmetic  time  to  build  up  the  preconditioner 
M  resp.  tcMRES  arithmetic  time  for  GMRES,  Lt  overall  time  of  the  iterative  method;  all 

times  in  CPU  seconds;  its.  number  of  iterations,  5  maximal  error  in  percent  in  comparison  to  the 
solution  vector  of  the  LU  decomposition. 
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One  has  to  consider  that  the  huge  numerical  effort  is  caused  by  the  LU  decomposition  of  the 
preconditioner.  The  effort  is  reduced,  if  a  great  number  of  partial  problems  has  to  be  solved 
(see  the  PEC  cube  array  example  with  five,  ex.  3,  and  nine  substructures,  ex.  4).  In  addition 
to  that  it  will  be  advantageous,  if  all  partial  problems  have  about  the  same  size.  It  makes  no 
sense  to  try  to  solve  a  problem  by  this  method,  if  the  number  of  unknowns  in  the  partial  problem 
is  only  little  smaller  than  the  overall  number  of  unknowns.  Remarkable  is  the  little  number  of 
required  iterations  of  GMRES.  This  number  is  also  reduced  if  the  number  of  unknowns  rises. 
This  effect  has  a  positive  impact  on  the  arithmetic  time,  as  then  a  lower  number  of  matrix-vector 
multiplications  and  vector-vector  operations  is  required.  Moreover,  the  storage  effort  is  limited, 
because  the  number  of  basis  vectors  that  have  to  be  stored  is  limited,  so  that  the  parameter  m 
can  be  chosen  very  small. 

The  results  show  that  if  a  problem  satisfies  all  preconditions  (a  number  of  galvanically  not 
connected  substructures  of  almost  the  same  size)  this  method  is  evidently  superior  to  the  direct 
method,  even  when  some  of  the  the  zero  blocks  are  utilized. 
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